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Editors” Note 


This issue of Papers & Reports on Child Language Development contains the 
proceedings of the Twenty-first Annual Child Language Research Forum, held at Stan- 
ford, April 7-9, 1989. We thank all the members of the CLRF committee whose hard 
work made the Forum possible; Lila R. Gleitman (University of Pennsylvania) for giving 
the Keynote Address; Jane Edwards, Kenji Hakuta, Ann Peters, and Richard Schwartz for 
organizing and leading Special Interest Groups; and all the participants who contributed 
so much to making the meeting @ success. 


Next year’s Forum, on April 6-8, 1990, will be coordinated by Miriam Butt and 
Tracy King. The Keynote Address will be given by Elinor Ochs (University of Southem 
California) on "Cultural Universals in the Acquisition of Language.” Abstracts (limit 
250 words) will be due on January 10, 1990, Abstracts and enquiries should be addressed 
to Coordinators/CLRF-90, Department of Linguistics, Stanford University, Stanford, CA 
94305. 


Aucust 1989 Eve V. Clark 
Elizabeth Owen Bratt 
Sik Lee Cheung 
Bonnie McElhinny 


Previous issues of PRCLD: Anticles from previous issues of Papers and Reports on 
Child Language Development are generally available through the ERIC Document 
Reproduction Ser ice, 3900 Wheeler Avenue, Alexandria, Virginia 22304, U.S.A. 
(Many articles that appeared in preliminary form in PRCLD have since been published in 
professional journals in the field.) 
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Keynote Address 


The structural sources of verb meaning* 


Lila Gleitman 


If we will observe how children learn languages, we 
will find that, to make them understand what the names 
of simple ideas or substances stand for, people 
ordinarily show them the thing whereof they would have 
them have the idea: and then repeat to them the name 
that stands for it, as ‘white', ‘sweet', ‘milk', 
'sugar', ‘cat,' 'dog'. (Locke, 1690, Book 3.1X.9) 


Is vocabulary acquisition as straightforward as Locke 
supposes? Three hundred years after the publication of the 
Es Unde Locke's is still the dominant 
position on this topic for the very good reason that common 
sense insists that he was right: Word meanings are learned by 
noticing ‘she real-world contingencies for their use. For 
instance, it seems obvious to the point of banality that the 
verb pronounced /run/ is selected as the item that means ‘run! 
because this is the verb that occurs most reliably in the 
presence of running-events. 


Or is it? Who has ever looked to see? Ona trouble with 
questions whose answers are self-evident is that investigators 
rarely collect the evidence to see if they pan out in practice. 


Since this occasion of a keynote address is a serious one, 
I certainly am not going to try to defeat the obviously correct 
idea that a crucial source of evidence for learning word mean- 
ings is observation of the environmental conditions for their 
use. I believe, however, that what is correct about such a 
position is by no means obvious, and therefore deserves serious 
study rather than acceptance as a background fact in our field. 


I'll limit the discussion to the topic of acquiring verb 
meanings, because this is where I and ny colleagues have some 
experimental evidence to offer in support of the position I want 
to adopt. Even within this subtopic, to begin at all If will 
have to make critical assumptions about some heady issues which 
deserve study in their own right. Particularly, I will not ask 
where the concepts that verbs encode come from in the first 
place, for example, how the child comes to conceive of such 
notions as 'run' (or 'think' or 'chase'). I want to look at the 
learner at a stage when he or she can entertain such ideas, how- 
ever this stage was arrived at. Second, I reserve for later 
discussion the question of how the child determines which word 
in the heard sentence is the verb -~ that it is the phonological 
object /run/, not /horse/ or /marathoner/, that is to be mapped 
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onto the action concept. 


The topic that remains seems a very small one: How does 
the learner decide which particular phonological osbiect 
corresponds to which particular verb concept, just Locke's 
topic. But I'll try to convince you that this question is 
harder than it looks. For one thing, matching the meanings to 
their sounds is the one part of acquisition that can't have any 
very direct “innate” support; this is because the concept ‘run' 
isn't paired with the sound /run/ in Greek or Urdu, so thn? 
relation must be learned by raw exposure to a specific languag%. 
For another thing, and as I'll try to convince you today, it's 
not clear at all that the required pairings are available to 
learners from their ambiant experience of words and the world. 


In the first half of this talk, I'll try to set out some of 
the factors that pose challenges to the idea that children can 
induce the word meanings from their contexts in the Sense Locke 
and his descendents in developmental psycholinguistics seem to 
have in mind. In this discussion, I will allude repeatedly to 
the work and theorizing of Steve Pinker, because he seems to me 
to be the most serious and acute modern interpreter of ideas 
akin to Locke's in relevant regards. Then, in response to 
these challenges to the theory of learning by observation, I 


will sketch a revised position laid out by Landau and Gleitman. 


(1985), illustrating it with some recent experimental evidence 
from our laboratory. The idea here is that, to a very con~ 
siderable extent, children deduce the verb meanings by consider- 
ing their syntactic privileges of occurrence. They must do so, 
because there is not enough information in the whole world to 
learn the meaning of even simple verbs. 


Part I: Some difficulties of learning by observation 


's idea: Differences in experience should yield differ = 
ences _in meanings 


At peril of carricaturing Locke -- but who doesn't? -- I 
select him as one who argued for a rather direct relation 


between knowledge and the experience of the senses. He fre- 
quently used the example of individuals born without sight as a 
testing ground for such a position. According to Locke, 


sighted and blind people ought both to be able to learn the 
meanings of such words as statue and fee] and sweet, but the 
blind ought to be unable to acquire picture and see and red, for 
these concepts are primitive (i.e., not derivable from other 
concepts) or derivable from primitives that are available only 
to the eye. 


Barbara Landau and I were directly inspired by Locke to 
study the acquisition of these vision-related terms by blind 
babies (Landau and Gleitman, i985). As our studies evolved, we 
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realized that exactly the same conceptual issues about learning 
arise for sighted vocabulary learners as for blind ones, so I 
will move on to discussion of such normally endowed children. 
The blind population, which I discuss first, is perhaps special 
only as the biographical point of origin of our own thinking but 
I suspect that, for you listeners too, it will serve to drama- 
tize some issues which seem less startling in the ordinary case. 
These have to do with how resistant the word-learning function 
is to the evidence of the senses. 


Landau and I were astonished to discover how much alike were 
the representations of vision-related terms by blind and 
sighted children at age 2 1/2 or so, despite what would appear 
to be radical differences in their observational opportunities. 
For instance, all these babies showed by their comprehension 
performances that they took look and gee as terms of perception, 
distinct from such contact terms as touch. As an example of 
this, a blind child told to "Touch but don't look at..." a 
table would merely bang or tap it. Whereas if told "Now you can 
leok at it" she explored all its surfaces systematically with 
her hands. Moreover, she understood leok to be the active (or 
exploratory) and see the stative (or achievement) term in this 
pair. Just as surprising, blind children as well as sighted 
children understood that green was an attribute predicable only 
of physical objects (they asserted that ideas could not in 
principle be green while cows might be, for all they knew). 
Thus the first principle that a theory of observational learning 
must be subtle enough to capture is that 


(i) The same semantic generalizations can be acquired in 
relative indifference to differing e1vironmental 
experience, if the notion "experience" is cast in 
sensory-perceptual terms. 


While we found the surprising result that blind children 
shared much knowledge about vision-related terms with their 
sighted peers, we also achieved the unsurprising result that 
there were some differences in how these two populations under- 
stood these terms to refer to their own perceptions: Blind 
children think that look and see describe their own haptic per- 
ceptions while sighted children think these same words describe 
their own visua] perceptions. Thus blindfolded sighted children 
of 3 years look skyward if told to "Look up!" but a »lind child 
of the same age holds its head immobile and searches the space 


1G) 


S) 
ERIC 


above in response to the same command (see Figures 1 and 2).+ 

This oWtcome is of just the sort that is subject to "ob- 
vious" explanations involving the extralinguistic contexts of 
use. We reasoned (as does everyone to whom one presents this 
set of facts): '‘Obviously,' a blind child's caretaker will use 
the terms look and see intending the child to perceive in 
whatever ways her sensorium makes available. And since the 
blind child's way of discovering the nature of objects is by 
exploring them manually, the caretaker will surely use look and 
see to this child only when an object is near enough to exp.ore 
manually. That is, the caretaker should say "Look at this boot" 
to her blind baby only if a boot is nearby, ready to be explored 
manually. The contexts of use for these words thus should in- 
clude -- among many other properties -- conversationaily perti- 
nent objects that are near at hand. Had the caretaker instead 
rattled a boot noisily by the child's ear whenever she said 
"Look at this boot", the learner would have surmised that look 
means ‘listen’. 


So here we have a straightforward prediction from the envi- 
ronment of use to the formation of a semantic conjecture: By 
hypothesis, the blind learner surmises that look involves hap- 
tic exploration because it is that verb which is used most reli- 
ably in contexts in which haptic exploration is possible and 
pertinent to the adult/child discourse. Landau and I decided 
to test that prediction to see if it was as true as it was 
obvious, 


To do so we examined videotapes of a mother and her blind 
child recorded in the period before the child uttered any vis- 
jon-related words or indeed any verbs at all (that is to say, 
during the learning period for these words), coding all verb 
uses according to whether they occurred when an object pertinent 
to the conversation (a) was NEAR enough to tne child for her to 
explore it manually, i.e., within arm's reach, (b) was FARther 
away than that, or (c) when there was NO such pertinent OBJECT. 
We hypothesized that look and see were the verbs used most reli- 
iably in the NEAR condition accounting for why the child had 


1 A related difference holds for the color words. Sighted 
children of four and five map the color words onto observed hues 
in the world while blind children ask for help. Perhaps they 
think the property is stipulative. Asked "Why are the flowers 
in the woods pink?" one blind child responded "Because we name 
them pink!" (Landau, personal communication). They know these 
are attributes predicable only of physical objects (they say 
that an idea can't be green because "it's only in your head") 
but they don't know what the real-world dimension may be. 
Interestingly, they avoid some choices that their extralinguis- 
tic experience appears to make available, e.g., that color terms 
refer to sizes of objects (Landau and Gleitman, ibid, ch. 8). 
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Figure 1: A blindfolded sighted child's response to the command Figure 1: A blind ctild's response to the command 
“Look up!“ (from Landau and Gleitman, 1985} "Look up!" (from Landat and Gleitman, 1985) 
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assigned them the meanings ‘explore/apprehend manually' (while 
other verbs would be used less often in this condition, ane so 
would not be assigned a haptic component in their meanings). 


The results are shown in Table 1. They fail to account for 
the child's haptic interpretation of look and see. Put id 
give and hold are the verbs used most reliably (over 95% 0” ue 
time) under the NEAR condition while look (73%) and espec J 
see (39%) are not as reliably associated with this condit.on. 
We can conclude that 


(ii) If representations of the environmental contexts are 
the basis for the semartic conjectures, these can't 
can't be just the simplest and most obvious represen- 
tations of those contexts that one can think of. 


It is worth pointing out before leaving tuis topic that the 
analysis of Table 1 cannot be written off as of some environmen- 
tal property that is hopelessly irrelevant to the child's 
analysis of events (though it is doubtless too simple, a fact to 
which I will return directly). Yor as it stands, this analysis 
extracts and explains important distinctions among verbs of 
physical motion that are in other respects semantically close, 
such as give vs get. The child is apparently told, sensibly 
enough, to give what she has in hand (this verb is used in the 
NEAR condition 97% of the time) but to GET what she doesn't have 
(the relevant NEAR percentage for this verb is 45%). 


Latitude of the hypothesis space 


Generalization (ii) brings me closer to topics I want to 
concentrate on today. Notice that the conclusion drawn 
was very weak -- not that it wasn't the contexts that led to the 
learning, but rather that the idea of "real-world context,” to 
succeed, must be a good deal more subtle than we (and others) 
originally supposed. That is, the response to the findings 
shown in Table 1 is usually, and perhaps should be: 


"Oh, but the contextual analysis you imposed was so feeble. 
Showing that it failed is only showing the failure of Landau and 
Gleitman's imagination. The child surely imposes a richer ana- 
lysis on the situation than that, and the only analysis relevant 
to the hypotheses under test is the one that the child herseif 
imposes." 


Fair enough. We limited the child to observing some 
perceptually obvious features of the situation, features that 
the infancy literat.re tells us are available even to babies. 
In other words, our aim was to see how far some small and 
independently documented set of observational primitives coud 
get the learner in extracting simple meaning features for 
assignments to the verbs. These were that the world is 
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Proportion used in contexts 


In hand Total number 

Verb ornear Far No ocbject considered* 
Perceptual verbs 

Look 73 09 18 34 

See 39 56 0s 18 

Other perceptual 56 44 00 17 
Nonperceptual verbs 

Come OS 32 oc 19 

Get 0 25 25 27 

Give 97 03 f& 21 

Go $2 24 24 20 

Have 53 A7 00 11 

Hold 1.00 .0O 00 10 

Play 70 .00 30 10 

97 00 03 61 
Say 43 07 SO 28 


a. These total to N=276, the number of utterances containing the common verbs 
(10 or more occurrences in the maternal corpus). The remaining 389 were dis- 
carded in this and following analyses, including 183 instances of be and 186 in- 
stances of rare verbs (fewer than 10 occurrences). 


Table 1: Situational contexts for the common verbs used by the 
blind child's mother during the learning period (from 
Landau and Gleitman, 1985) 
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populated with objects which endure over time (Spelke, 1982), 
and which move relative to each other (Lasky & Gogol, 1978) and 
with respect to the positions of the child's own body (Acredolo 
and Evans, 1980; Field, 1976). These assumptions put the child 
in a position to conceive of the situation as one of objects-~- 
in this case, objects whose noun names are known to the child-- 
moving (as described by the verb) between sources and goals. 
For example, for give the object moves from NEAR as action 
begins tg FAR when it ends, and in get the object goes from FAR 
to NEAR. 


It can hardly be denied, in light of the infancy evidence, 
that youngsters do represent situations in terms of the posi- 
tions and motions of pertinent objects. What is surely false, 
however, is that such categories are exhaustive amongst the 
child's extralinguistic analyses. Infants come richly pre- 
pared with means for picking up information about what is going 
on in their environment -- looking, listening, feeling, tasting, 
and smelling; in fact these different Sensory routes appear to 
be precoordinated for obtaining information about the world 
(Spelke, 1979). To take a few central examples, infants per- 
ceive the world as furnished with objects which are unitary, 
bounded, and persist over time and space (see Gibson and Spelke, 
1983), and which cannot occupy two places at one time (Baillar- 


geon, Spelke, and Wasserman, 1985). They distinguish among the - 


varying properties of objects, e.g., their rigidity or elas- 
ticity (Gibson and Walker, 1984), their size (Golinkoff et al, 
1984a), their colors (Bornstein, 1975), whether they are moving 
or stationary (Ball and Vurpillot, 1976), their positions and 
motions relative to the child observer (Field, 1976), their 
animacy (Golinkoff et al, 1984b) and even their numerosity 
(Starkey, Gelman, and Spelke, 1983). If you think there's 
something that infants can't or won't notice, look in the next 
issue of Developmental] Psychology and you will probably discover 
that someone proved they can. 


Now that I have acknowledged something of the richness of 
infant perception, why not let the learner recruit this consi- 
derable armamentarium for the sake of acquiring a verb vocabula- 
ry? That is, why not assume that the child encodes the situa- 
tion not only in the restricted terms that yield Table 1, but in 
myriad other ways? for instance, over the discourse as a whole, 
probably the mother has different aims in mind when she tells 
the child to "look at" some object than when she tells her to 


2 We hasten to say such an analysis can succeed at all 
only if the child can determine the discourse addressee. This 
assumption is plausible because (1) in these transcripts, at 
least, the mother's speech is over 95% about the "here and now" 
na (2) in over 90% of instances, the addressee is the child 

erself. 
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"hold" or "give" it. |The child could code the perceptual world 
for these perceived aims and enter these properties as aspects 
of the words' meanings. But also the mother may be angry or 
distant or lying down or eating lunch and the object in motion 
may be furry or alive or large or slimy or hot, and the child 
may code for these properties of the situation as well, entering 
them too as facets of the words’ meanings. 


The problems implicit in such an expansion of the represen- 
tat onal vecabulary should be familiar from the literature on 


syntax acquisition: The trouble is that an observer who 
notices everything can learn nothing, for there is no end of 


categories known and constructable to describe a situation. 


Indeed, not only learnability theorists but all syntac- 
ticians in the generative tradition appeal to the desireability 
of “narrowing the hypothesis space" lest the child be so over- 
whelmed with representational options and Gata-manipulative 
capacity as to be lost in thought forever. At least, learning 
of syntax could not be as rapid and uniform as it appears to 
be, unless the child were subject to highly restrictive princi- 
ples of Universal Grammar, which rein in her hypotheses. As one 
famous example, the learner is said to assume that all syntac- 
tic generalizations are structure-dependent rather than serial- 
order dependent (Chomsky, 1975; see also Crain and Fodor, in 
press). In fact, Universal Grammar is said to be as constrained 
as it is owing to the child's requirement that this be so. 


I put it to you: Are these observations about the aiffi- 
culties of learning when the hypothesis space is vast no less 
true of word learning than of syntax? In the domain of 
vocabulary acquisition as much as that of syntax acquisition, 
there is remarkable efficiency and systematicity of learning 
across individuals (and, as the blind children show, across 
learning environments): The rapidity and accuracy of vocabulary 
acquisition are jewels in the crown of rationalistically ori- 


3 As se often, Chomsky (1982) sets the problem with great 
clarity: "...The claim we're making about primitive notions is 
that if data were presented in such a way that these primitives 
couldn't be applied to it directly, prelinguistically before you 
have a grammar, then language couldn't be learnt...And the more 
unrealistic it is to think of concepts as having those proper- 
ties, the more unrealistic it is to regard them as primitives 
...We have to assume that there are some prelinguistic notions 
which can pick out pieces of the world, say elements of this 
meaning and of this sound." The analysis of Table 1 is an 
attempt to see how far some small and independently documented 
set of observational primitives could get the learner in ex- 
tracting a simple meaning feature ("haptic') for assignment to 
certain verbs. 
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1 psycholinguistics (see particularly Carey, 
1982). So just as in the case of syntax, we have initial 
grounds for claiming that a limit on the hypothesis space must 
be a critical source of sameness 

Bolistering the same view, languages seen to be as alike in their 
elementary vocabularies as they are 
(see for example Talmy, 1975; 1985). But surprisingly enough, 
all the telling arguments, invoked for syntax, to restrict the 
interpretation of the input -- that is, constraints on represen~ 
tations -- that are to explain these samenesses in form, con- 
tent, and learning functions, ace thrown out the window in most 
theorizing about the lexicon. There it is usually maintained 
that the child considers many complex, varying, cross-cuttinc, 
subtle conjectures about the scenes and events in view so as to 
arrive at the right answers, comparing and contrasting possi- 
pilities across many events, properties, discourse settings, and 
so forth. In other words, testing and manipulating an exceed- 
ingly broad and free-ranging hypothesis space. 


A very few investigators have been resgonsive to the 
issues here. Pinker (1987), in a direct and .sseful discussion 
of the requirement to limit the space of observables that a 
learner will consider in matching the event to the unknown verb, 


writes as follows: 


Verbs! definitions are organized around a4 surprisingly 
small number of elements: "The Main Event", that is, 
a state or motion; the path, direction, or location of 
an object, either literal spatial location or some 
analogue of it in a nonspatial semantic field; 
causation; manner; a restricted set of the properties 
of a theme or actor; temporal distribution (aspect 
and phase); purpose; coreferentiality of participants 
in an event; truth value (polarity and factivity); and 


a handful of others. 
(1887, p- 54) 


It is an open question whether Pinker's proposed list is 
narrow enough to meet the requirement for a realistic set of 
primitives upon which a verb-learning procedure can operate. 
Are purposes, truth values, causes, not to speak of ‘analogues 
of spatial location in nonspatial semantic fields" really 
primitives that inhere in the observations themselves? It 
seems to me highly unlikely that any choice of 
constraints will be restrictive enough to delimit the analyses a 
child performs in reaction to each event/ verb pair. Of course 
I'm not suggesting that there aren't principles of perception 
that are restrictive and highly structured (God forbid!). But 
they are likely not restrictive enough to account for vocabulary 
acquisition. How could they be? Perception has to be rich 
enough to keep the babies from falliny off cliffs and mistaking 
distant tigers for nearby pussycats lest they all disappear from 
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tne face of the earth before learning the verb meanings. 


However, the richness of perception is not the only, or 
even the major, problem faced by a hypothetical learner who 
tries to acquire verb-meanings from observation. The more dif- 
ficult problem is that even the homeliest and simplest verbs, 
though they refer to events perceivable, encode also the unob- 
servable present interests, purposes, beliefs, and perspectives 
of the speaker. I turn now to this class of problems. 


Perspectives on events 


Consider the learning of simple motion verbs, such as push 
or move, In a satisfying proportion of the times that care- 
caretakers say something like “George pushes the truck," George 
can be observed to be pushing the truck. But unless George is a 
hopeless incompetent, every time he pushes the truck, the truck 
will move. So a verb used by the caretaker to describe this 
event may represent one of these ideas ('push') or the other 


('move'). 


Moreover, every real event of the pushy sort necessarily 
includes, in addition to the thrust and goal, various values of 
trajectory, rate, and so forth, so that such ideas as !slide,' 
' ‘os ,' ‘srawl,' and so on, are also relevant inter- 
pretations of a new verb then uttered. What is left open by 
the observation is whether that verb represents any or all of 
these manner differences: no, in the case of push, but yes in 


the case of rol] or rumble. 


Note that the manner elements just mentioned do fall within 
the range encoded by verbs in many languages (Talmy, 1985) and 
are on the narrowed list of perceptual properties suggested by 
Pinker (1987). I leave aside various other interpretations 
often called "less salient" (i.e., I ignere more genera) 
consideration of the "stimulus-free" character of language use; 
see Chomsky, 1959), especially the countless zany inte:preta- 
tions of this event that could be drawn by worried philoso- 
phers. 


4 Jerry Fodor has suggested to me, maybe seriously, that 
these problems go away because the caretaker and child are in 
cahoots, and they are mind-resders. They are so attuned in 
discourse, »eing creatures of exactly the same sort, that the 
child zz,3 onto exactly the characteristics of the situation 
that the mother, just then, has in mind to express (see Bruner, 
1974/5 for a story about how the attentional conspiracy is set 
up by mother and child, and Slobin, 1977, for a related account 
of the conversational environment). A related position is 
maintained by Pinker (citing Keenan) about situations the 
learner might select as learning opportunities; in the case 
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It is plausible that these ambiguities are eliminated by 
looking at a verb‘'s uses across situations (see again Pinker, 
1987). There will eventually be some instance of moving called 

tn which the truck is moving rapidly, eliminating ‘crawl' 


/ 
as a conjecture about the meaning of this item, etc. By a pro- 


cess of cross comparison and elimination, each verb may eventu- 
ually be distinguishan.e. The worry is only that the burden of 
hypothesis testing is becoming ominous as the comparison set (of 
verbs, properties, and scenes) enlarges. 


Difficult problems can be solved. But impossible ones are 
harder. Consider such verbs as {flee ana chase, buy and sell, 
win and beat, give and get, and so on. Such pairs are common in 
the design of verb lexicons. Each pair alludes to a single kind 
of event: Whenever the hounds are chasing “he fox, the fox is 
fleeing from the hounds. If some hounds are racing, even with 
evil intentions, toward a brave fox who holds its ground, they 
can't be said to be chasing him. Chasing implies fleeing, 
necessarily. If the child selects a verb from the stream of 
speech accompanying such a scene, how then is she to decide 


2 


whether it means 'chase' 5r ' fleet? 


— 


Pinker is discussing, the child is tc discover the property 
supiect from its semantic/pragmatic envizonmental correlates: 


The semantic properties of subject hold only in basic 
sentences: roughly, those that are simple, active, 
affirmative, declarative, pragmatically neutral and 
minipally presuppesitional...The parents...or the child 
might filter out nonbasic sentences from the input using 
various contextual or phonological diagnostics of nonbasic- 
ness such as special intonation, extra marking on the verb, 
presuppositions set up by the preceding disceurse or the 
context, nonlinguistic signals of the interrogative or 
negative illocutionary force of an utterance, and so on. 
(Pinker, 1984, pp 46/7)- 


Note again the number and nontransparency of the experiential 
analyses necessary within this perspective. 


SI may well be granting too much here. After all, 
touching, and even breathing and existing are going on in the 
presence of all moving and pushing events. So it's probably not 
true that a unique interpretation of verbs from scenes can ever 
be extracted, whatever the ornateness of the scene-storage and 
manipulation procedures may be. Not at least withuut invoking 
notions of "salience" which is likely just substitution of 
unknowns for unknowns. 
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Such examples are thrusts to the heart of the observational 
learning, hypothesis. As Pinker (1987, p. 54) acknowledges, 
"Basically, we need to show that the child is capable of enter- 
taining as a hypothesis any possible verb meaning, and that he 
or she is capable to eliminating any incorrect hypothesis as a 
result of observing how the verb is used across situations." 
But chase and flee (and a host of similar pairs) are relevantly 
used in all and only the same situations. It follows that it 

be shown that the child is capable of eliminating the 
incorrect hypotheses by cross-situational observation. 


I think the problem is that words don't aescribe events 

. If that's all words did, we wouldn't have to talk. 

We could just point to what's happening, grunting all the while. 

But instead, or in addition, the verbs seem to describe specific 

perspectives taken on those events by the speaker, perspectives 

which are not "in the events" in any direct way. How far are 

we to give the learner leave to divine the intents of his elders 

as to these perspectives? Are they talking of hounds acting 
with respect to foxes, or of foxes with respect to hounds? 


Speaking more generally, since verbs represent not only 
events but the intents, beliefs, and perspectives of the speak- 
ers on those events, the meanings of the verbs can't be ex- 
tracted solely by observing the events. 


The subset problem 


A related problem has to do with the level of specifi- 
city at which the speaker, by the words he chooses, refers to 
the world. Consider the homely little objects in the world, 
the pencils, the ducks, the spoons. All these objects are 
supplied with more than one name in a language, e.g., animal. 

é I expect that the adult speaker has little 
difficulty in selecting the level of specificity he or she wants 
to convey and so can choose the correct lexical item to utter in 
each case. And indeed, the learner may be richly pre-equipped 
perceptually and conceptually so as to be able to interpret 
scenes at these various levels of abstraction, and to construct 
conceptual taxonomies (Keil, 1979). But as usual this very 
latitude adds to the mystery of vocabulary acquisition, for how 
is the child to know the level encoded by the as yet unknown 
word? The scene is always the same if the child conjectures the 
more inclusive interpretation (that is, if her first conjecture 
is animal rather than duck). For every time there is an obser- 
vation that satisfies the conditions (whatever these are) for 
the appropriate use of duck, the conditions for the appropriate 
use of animal have been satisfied as well. 


Analogous cases exist in the realm of verb meanings. To 
return to the instance dramatized by the blind learners, Per- 


ceive, see, look, eye (in the sense of ‘set eyes on‘), face, 


S) 
ERIC 


14 


, pose the same subset problem. There is no seeing with- 


out looking, looking without facing, facing without orienting, 


etc. All this suggests that not only blind childcren, but 
sighted children as well, should have (essentially the same) 
difficulties in learning the meanings ef look and see, because 
the distinction between the two words is not an observable 
property of the situations in which they are used. Yet, as I 
@iscussea earlier, it is just these “unobservable” properties 
that the blind and sighted three year olds held in common. 


Gold (1967) addressed a problem that seems related to this 
one. He showed formally that learners whe had to choose be- 
tween two languages, one of which was a subset of the other, 
could receive no positive evidence that they had chosen wrong if 
they happened to conjecture the superset (larger) language. This 
is because the sentences they would hear, all drawn from the 
subset, are all members of the superset as well. It has there- 
fore been proposed that learners always hypothesize the smaller 
(subset) language; they initially select the most restrictive 
value of a parameter on which languages vary (Berwick, 1981; 
Wexler and Manzini, 1987). 


But the facts about the lexicon do not allow us to suppose 
that the child has a solution so simple as choosing the least 
inclusive possibility. In the end, they acquire all of then. 
Moreover, neither the most inclusive nor the least inclusive 
possibilities seem to be the initial conjectures; rather, some 
‘middle" or "basic" level of interpretation is the one initially 


selected, i.e., duck and look (as opposed to mallard and 
seem to be the real first choices of the learners. 


In short, words that stand in a subset relation pose an 
intractable problem for an unaided observation-based learning 
procedure. This is because the child who first conjectures the 
more inclusive interpretation can receive no positive evidence 
from word-to-world mappings that can dissuade him. And the 
idea that he always begins with the least inclusive interpreta- 
tion consistent with the data is falsified by the empirical 
facts. 


But the verbs that most seriously challenge the semantic 
-pootstrapping proposal still remain to be discussed: These are 


6 these results can't be written off on grounds of the 
differential frequency of these words in the input corpus, for 
if the frequencies are changed the level of categorization does 
not. For instance, in some houses Fide is a more frequent word 
than dog, but in that case the youngest children think that the 
word meaning 'dog' is /faydo/ (Rescorla 1980). 
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tne ones that don't refer to the observable world at ail. 


Locke noted that the meanings of many words involve proper- 
ties that are closed to observution, but he did not consider 
this fact to be fatal to his overall position because his 
experience, partly warranted, was that those who used such 
Nabstract™ words didn't know what they were talking about half 
the time anyhow. Neverthelese a key problem for an unaided 
observational-learning story is ‘that too many words that even a 
three or four-year old understands are related to the real world 
only in the most obscure and unobservable ways, if at all. Try, 
for example, to learn the meaning of the word think by titrating 
discourse situations into those in which thinking is going on, 
somewhere, when you hear /think/, vs those in which no thinking 
is happening. Remember that there isn't always brow-furrowing 
or a Rodin statue around to help. Keep in mind also that you 
are going to have to distinguish also among think, quess, won- 

and understand, not to speak of -- a 
few months or years later -- 


discover, perceive, etc. 


Many developmental psycholinguists rule such instances out 
of school on the grounds that these aren't words that children 
know very well at two and three years old, but this won't do. 
After all, we also want to understand the children who manage to 
survive to become the four and five year clds. 


I don't really think this topic needs much more belaboring. 
If the child is to learn the meanings from perceptual discrimi- 
nanda in the real world, the primitive vocabulary of infant 
perception has to be pretty narrow to bring the number and var- 
iety of data storing and manipulative procedures under control. 
But no such narrow vocabulary of perception could possibly 


select the thinkingness properties from events. I conclude that 
an unaided observation-based verb learning theory is untenable 
because it could not acquire think. 

Summary 


I've mentioned a number of problems for a theory that 
(solely or even primarily) performs a word-to-world mapping to 
solve the vocabulary learning task. These are that (1) such a 
theory fails to account for the fact that children whose 
exposure conditions are radicaily different acquire much the 
same representations of many words; (ii) plausible, narrowly 
drawn, candidates for event representation seem to be inadequate 
in accounting for the learning in certain apparently easy cases; 
(iii) broadening the hypothesis space so as to allow learners to 
distinguish among the many verb meanings mcy impose unrealistic 
storage, manipulation, and induction demands on the mere babes 
who must do the learning. In addition, (iv) many verbs are 
identical in all respects except the perspectives that they 
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adopt toward events or (v) the level of specificity at which 
they describe a single event; or (vi) don't refer to events and 
states that are observable at all. Since children learn the 
verb meanings despite these apparently formidible problems, ny 
conjecture is that they have another source of information that 
redresses some of the insufficiencies of observation. 


Part II: New approaches for vocabulary acquisition 


I return now to the problem Landau and I faced in under- 
standing the blind child's semantic achievements. Keep in mind 
that the analysis of Table 1 was an attempt to explain only the 
most straightforward, perceptually relevant, aspect of her 
acquisition of look and see, namely that if these verbs had to 
do with haptic perception, there must have been pertinent 
objects close to her hands when her mother said those words. 
Yet even this simple idea seemed to be falsified by our 
analysis. 


To find out why, our first step was to return to the data 
of Table 1 to see where and when the NEARNESS constraint had 
failed for so many uses of look and see. We found that the 
sentences that fell neatly under the object-nearby conjecture 
were very simple ones: If the mother had said something like 


Look at this boot! 
or See the apple? 


invariably the boot or apple were NEAR, within the blind child's 
reach. But if the mother said 


: r ' (while dialing the phone) 
Leok what you're doing! 
You look like a kangeroo in those overalls. 
or Let's go see Poppy. 


the "pertinent object" was likely to be FAR or there was NO such 
pertinent OBJECT intended. Clearly, the sentences that tripred 
up our simple story were queer ones indeed. The mother didn't 
seem in most of these cases to mean ‘examine or apprehend' 
either haptically or visually, but rather ‘determine’, ‘watch~ 
out', or ‘resemble.' Or else, as in the final example, 4 
motion auxiliary (qo) in the sentence transparently took off the 
NEARbyness requirement. 


There are two ways to go now: One can claim that the 
NEARbyness environmental clue to the haptic interpretation was 
just a snare and delusion -- but that is ridiculous. It just 
HAS to be right that this aspect of the environment was part of 
what licensed the child's haptic interpretation. The other 
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choice is to find some non-question-begging way through which 
the child could have gotten rid of the sentences that otherwise 
would threaten the experiential conjecture. (The question- 
begging way, of course, is to say that the mother didn't mean 
thaptically explore’ in the offending sentences). 


How can this be done? The clue is that not only the 
meaning, but the syntax too, of these offending sentences is 
special -- different from the syntax of sentences in which the 
child was really being told to explore and perceive nearby 
objects. This syntactic distinction may be available to the 


learner. 


A syntactic partitioning of the verbs commonly used by the 
mother of the blind baby (based on the same corpus analyzed in 
Table 1), according to the subcategorization frames in which 
each verb appeared in the maternal corpus, is shown in Table 2; 
the verbs of Table 1 appear as the columns in this table, and 
the syntactic environments appear as the rows; the numbers in 
each cell are the number of instances of a verb in some parti- 
cular syntactic environment. ’ Notice first that some of the 
typical syntactic environments for look and see are quite 
different from those for the other verbs in the set. 


Moreover, we can -~- with only a little fudging -- divide 
the environments of the vision-related verbs so as to pull apart 
those environments in which the NEARbyness contextual cue holds, 
and those in which it does not: That analysis is shown in Table 
3. Essentially, the top rows of Table 3 show the maternal uses 
of these verbs in their canonical subcategorization frames 
(e.g., "Look at/see the frog," "Look up/down") and the deictic 
interjective uses that are the most frequent in that corpus 
(e.g., "Look!, That's a frog!" and “See?, That's a frog!"). 
When these syntactic types only are considered, the NEAR 
proportion of logk rises (to 100%, from 73% in Table 1) and so 
does the NEAR proportion of see (to 72% from 39%). Thus if 
the learner can aiid does perform these analyses, the first 
result is that NEARbyness of the pertinent object becomes a much 
more reliable real-world clue than previously. But notice that 
the hypothesis now is that the child performs a sentence~to- 
world mapping, rather than the word-to-world mapping shown in 
Table 1: The child's interpretation of extralinguistic events 
has been significantly modulated by her attention to 
events, namely the syntax. 


Landau and I made yet another, and much stronger, claim 
based on the kinds of outcomes shown in Table 2. This was that 


7 Syecifically, the rows of this table represent sub- 
categorization frames, the sister-nodes to V under the verb 


phrase. 


ro 
cat 


Table 2: 
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Oe 
Group I Group II Group It 
Look’ See* Give’ Put’ Hold’ Play Get’ Have Go* Come’ 


Exclude look/ser 
Vv NP PP Ss su i 2 
V NPD 28 6 2 


Vv NP NP 16 2 


2 5 10 13 


Ld 
a 
a 


Vv NP 3 3 3 13 +44 
V AP 2 3 
Totals M 1% © 6t 61 10 10 327 #158 20 19 
a. Verbs that occur with locative prepositions and adverbs. 
b. A causative use of have: “Will we have Barbara come baby siti 
c. Play with the nonlocative (reciprocal) preposition wrth “You're not gonna play 
with the triangle, so forget it!” 


Subcategorization privileges of the common verbs used 
by the mother of a bi’-d child during the learning 
period. The number i. each cell represents the number 
of times that a verb is used in a particular frame 
environment (from Landau and Gleitman, 1985) 
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the range of subcategorization frames has considerable potential 
for partitioning the verb set semantically, and that language 
learners have the capacity and inclination to recruit this 
information source to redress the insufficiencies of raw 
observation. This examination of structure as a basis for 
deducing the meaning is the procedure we've called "syntactic 
bootstrapping. * I turn now to a comparison of the hypothesis 
called “semantic bootstrapping" by Pinker to the one called 
"syntactic bootstrapping" by Landau and ne. 


The bootstrapping proposals compared 


The two bootstrapping proposals are much alike in what they 
claim about correspondances between syntax and semantics, and 
are also alike in proposing that the child makes significant use 
of these correspondances. First I'll sketch, very briefly and 
informally, the kinds of syntactic/semantic correspondances 
that are crucially invoked in both proposals. 


Syntactic/semantic linking rules: To an interesting 
degree,. the structures in which verbs appear are projections 
from their meanings. To take a simple example, the different 
number of noun-phrases required by the verbs lauqh, smack, and 
put in the sentences 


(1) Arnold laughs. 
(2) Arnold smacks Gloria. 
(3) Gloria puts Arnold in his place. 


is clearly no accident, but rather is semantically determined-- 
by how many participant entities, locations, etc., the predicate 
implicates. Similarly, the structural positions of these noun- 
phrases relative to the verb also carries semantic information; 
thus, much more often than not the subject noun-phrase will 
represent the actor or causal agent (e.g., Arnold in sentence 1 
and Gloria in sentence 2), and paths and goals will appear in 
prepositional phrases (in his place, in sentence 3). These 
links of syntactic position and marking to semantic properties, 
while by no means unexceptional, typify the ways that English 
represents semantic-relational structure. In short, verbs that 
are related in meaning share aspects of their clausal syntax. 
Zwicky (1971) put the idea this way: 


"Tf you invent a verb, say greem, which refers to an act 
of communication by speech and describes the physical charac- 
teristics of the act (say a loud, hoarse, quality), then you 
know that...it will be possible to greem (i.e. to speak loudly 
and hoarsely), to greem for someone to get you a glass of water, 
to greem at your sister about the price of doughnuts, to green 
"Ecch" at your enemies, to have your greem frighten the baby, to 
greem to me that my examples are absurd, and to give a greem 
when you see the explanation." 
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syntax: As I mentioned earlier, both the bootstrapping propo- 


Sals make cricical use of these canonical relations between 
syntax and semantics. In the semantic bootstrapping procedure, 
the chila fixes the meaning of a verb by observing its real- 
world contingencies. In Pinker's (1987) words: 


" . the child could learn verb meanings by (a) sampling, on 
each occasion in which a verb is used, 4a subset of the 
features...8, (b) adding to the tentative definition for 
the verb its current value for that feature and (c) 
permanently discarding any feature value that is con- 
tradicted by a current situation." 


I have argued at length that this position is too strong, for at 
least some features are unobservable. Yet no one can doubt 
that, at least sometimes, the context of use ‘is so rich and 
restrictive as to make_a certain conjecture about interpretation 
overwhelmingly likely.? 


Once the verb meaning has been extracted from observation, 
the semantic bootstrapping hypothesis invokes the linking rules 
(the canonical syntactic/semantic mappings) to explain how the 
child discovers the structures which are licensed for the use of 
these verbs, much in the spirit of Zwicky's comments about the 
invented word qreem. For instance, if a verb has been disco- 
vered to mean give, then it will appear in three-argument 
structures such as iv . This is because 
the logic of 'give’ implies one who gives, one who is given, and 
that which is given, and each of these entities requires a noun- 


8 The features are those mentioned in my earlier citation 
of Pinker (page lO of this manuscript). 


9 At peril of making one argument too many, however, I 
can't resist complaining that Pinker's proposed procedure is 
too extreme. After all, sometimes the child is attending to 
one thing (say, the dog under the table) when the mother says 
something irrelevant to that (say, "Eat your peas, dear!"). So 
the learner better not "discard permanently" any feature that 
contradicts the current situation co 
j In fact, positive imperatives pose one of the most 
devastating challenges to any scheme that makes word-to-world 
pairings for the mother will utter "Eat your peas!" if and only 
i£ the child is not then eating his peas. Thus a whole class of 
constructions seems to be reserved for saying things that 
mismatch the current situation. 
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phrase to express. 


Not only is this position plausible. There is much evi- 
dence in its favor. Notably, Bowerman (1976; 1982) showed that 
children will make just such predictions about the syntactic 
structures licensed for verbs, presumably based on their prior 
fixing of the verb meanings: That evidence came from instances 
where children's conjectures were evidently too bold or insuf~- 
ficientiy differentiated; that is, where they were wrong -~- but 
still understandable. For instance, a subject of Bowerman's 
commanded "Don't eat the baby ~-- she's dirty!" on an occasion 
when the mother was about to feed the baby (whose diaper needed 
changing). Presumably, the child had conjectured that an in- 
transitive motion verb (e.g., sink, as in The ship sank) could 
be uttered in a transitive structure (such as 
the ship) to express the causal agent of this motion. 


To summarize, the semantic bootstrapping procedure as 
developed by Grimshaw (1981), Pinker (1984) and others, works 
something like this: The child is conceived as listening to the 
words used, and then trying to figure out their meanings by 
observing their situational concomitants, the word~to-world 
pairing that I've discussed. Quoting Pinker (1984) again, 


If the child deduces the meanings of as wet uncomprehended 
input sentences from their contexts and from the meanings 
of their individual words, he or she would have to havve 
learned those word meanings beforehand. This could be 
accomplished by attending to single words used in isola- 
tion, to emphatically stressed single words, or to the 
single uncomprehended word in a sentence...and pairing it 
with a predicate corresponding to an entity or relation 
that is singled out ostensively, one that is salient in the 
discourse context, or one that appears to be expressed in 
the speech act for which there is no known word in the 
sentence expressing it (p. 30). 


Once the meanings have been derived from observation, the child 
can project the structures from her (innate) knowledge of the 
rules that map semantic structures onto syntactic structures 
(by procedures variously called mapp 

projection rules, or ). Perhaps so, 
but I have been arguing that entities and relations cannot in 
general be singled out ostensively, that "salience" and the 
question of what's "expressed in the speech act" are not so 
easily recoverable as this perspective must insist. For such 
reasons, Landau and I developed a procedure that looks quite 
different from this. 


ct cots : The syntactic bootstrapping 
proposal in essence turns semantic bootstrapping on its head. 
According to this hypothesis, the child who understands the 
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mapping rules for semantics onto syntax can use the observed 
syntactic structures as evidence for deducing the meanings. 
The child is conceived as having certain concepts in mind, say, 
‘Jook' or 'put', and is engaged in a search for the words that 
express these concepts. To accomplish these aims, the child 
observes the real-world situation but also observes the 
structures in which various words appear in the speech of the 
caretakers. That is to say, the child performs a sentence-to- 
world pairing rather than a word-to-world mapping. Such a 
procedure can succeed because, if the syntactic structures are 
truly correlated with the meanings, the range ef structures will 
be informative for deducing which word (qua phonological object) 
goes with which concept. Such a procedure will be quite handy 
if, as I have argued, raw word-to-world mapping cannot succeed. 
The difference between semantic bootstrapping and syntactic 
bootstrapping, then, is that the former procedure deduces the 
structures from the word meanings that are antecedently ac- 
quired from r>l-world observation; while the latter procedure 
deduces the word ~xeanings from the semantically relevant syntac- 
tic structures as ciated with a verb in input utterances. 


Let us take the simple examples of put, look. and see., 
which occurred in the corpus provided by the blind child's 
mother. verbs that describe externally caused transfer or 
change of possessor of an object from place to place (or from 


person to person) fit naturally into sentences with three noun-. 


phrases, e.g. John put the ball on the table. This is just the 
kind of transparent syntax/semantic relation that every known 
language seems to embody and therefore may not be too wild to 
conjecture as part of the original presuppositional structure 
that children bring into the language learning task (Jackendoff, 
1978: 1983; Talmy, 1975; Pinker, 1984). That is, ‘puttina' 
legically implies one who puts, a thing put, and a place into 
which it is put; a noun-phrase is assigned to each of the 
participants in such an event. In contrast, since one can't move 
objects from place to place by the perceptual act of looking at 
them, the occasion for using leek in such a structure hardly, if 
ever, arises (John looked the ball on the table sounds un- 
natural). Hence the chances that /put/ means 'put' are raised 
and the chances that /put/ means 'look' are lowered by the fact 
that the former and not the latter verb appears in thpee=noun- 
phrase constructions in caretaker speech (see Table 2). 


external agent can cause an object to move by looking at it, 
@.g-, =a". 1S i> ot he 2& é J ch oS o In 
effect, once jlook does mean cause-to-move-by-perceptualiy- 
exploring, it becomes comfortable in this construction. of 
course these simpl2 examples vastly underestimate the detail 
required if such a theory is to become viable. One such 
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Near Far No object “Near” proportion 


Canonical sentence 


frames and deictic uses 
Look at NP 3 0 0 
Look D 2 re) 0 1.00 
Look! 8 is] a] 
Look! this is NP 10 0 0 
See NP 1 2 0 
See? 1 o 0 Je 
See?, This is NP 3 re) 0 
With motion auxiliaries 
Come see NP 0 3 0 co 
Other environments 
Look AP ) 1 1 
Look like NP 0 fe) 5 18 
Look ¢ 2 0 ra) 
See S 2 3 0 25 
See # re) 2 1 
Total {all environments) 
Look 25 3 6 73 
See 7 10 1 39 


Table 3: Situational contexts for the common verbs used by the 
blind child's mother, organized according to the 
syntactic (subcategorization frame) contexts (from 
Landau and Gleitman, 1985) 
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Verbs of perception and cognition are associated with some 
other constructions, as they should be. For example, if a verb 
is to mean 'see' (perceive perceptually), it should appear with 
noun-phrase objects as in John sav 8 mouse, for noun-phrases are 
the categories that languages select to describe such entities 
as mice. But since events as well as entities can be per- 
ceived, this verb should also appear with sentence complenents, 
since clauses are the categories selected by languages for 
expressing whole events (@.g., 's s there's ch 

bs The possibility that /gee/ means 'see' is 
increased by appearance in this construction, and the likelihood 
that /put/ means 'see' is decreased by the fact that one hardly, 
if evcr, hears iet's put if there's cheese in the refrigerator: 
see again Tabl 2). 


Speaking more generally, certain abstract semantic ele- 
ments such as ‘cause,' 'transfer,' and ‘cognition’ are carried 
on clause structures (subcategorization frames) rather than (or 
in addition to) as item-specific information in the lexical 
entries. of verbs. These semantically relevant clause struc 
tures will be chosen Zor utterance only to the extent that they 
fit with the overall meanings of the verb items. It follows 
that the subcategorization frames, if their semantic values are 
known, can convey important semantic information to the verb 
learner. To be sure, the number of such clause structures is 
quite small compared to the number of possible verb meanings: 
It is reasonable to assume that only a limited number of highly 
general semantic categories and functions are exhibited in the 
organization that yields the subcategorization frame distinc~ 
tions. But each verb is associated with several of these 
structures. Each such structure narrows down the choice of 
interpretations for the verb. Thus these limited parameters of 
structural variation, operating jointly, | can predict the 
possible meaning of an individual verb quite closely. Landau 
and Gleitman showed that the child's situation”: and syntactic 
input, as represented in Tables 2 and 3, were sufficient in 
principle to distinguish among ail the verbs commonly used in 
the maternal sample for the blind child. This general outcome 
is schematized in Figure 3. 


The potential virtues of this syntactically informed verb- 


problem is that the child must impose the proper parse on the 
sentence heard, lest b table be taken as 
a counter-example (that is, the analysis is to be of sister- 
nodes under VP only, and a theory of how the child determines 
such configurations antecedently is a requirement of the 


position). Another real difficulty is that the child might run 
John saw his brother 


into one of many quirky constructions like Jo 
out of the room, looked his uncle in the eye, etc. 
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motion nonmotion 
(locative P’s and D’s) (no locative P’s and D's) 
alienable inalienable play 
(3 arguments) (2 arguments) have 
“near” “far” physical ental 
(no S-complement) (S-complements) 
(‘far’) (‘near’} 
give get '  {(Deictics) 
hold . 
put come 
go 
Active : Stative 
{inter}. commands) {inter}, queries) 
look see 


Figure 3: Summary of the situational and syntactic distinctions 
among verbs commonly used by the mother to the blind 
child during the learning period. (from Landau and 
Gleitman, 1985) 
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learning procedure are considerable. First, it serves the 
local purpose of offering 4 non-maygical explanation for the 
blind child's acquisition of visual terms, as just described. 
Second, it points the way toward acquisition of terms when 
observation fails. This is because, for example, mental verbs 
such as think are unambiguously marked by the syntax (by taking 
sentence complements) even though their instances cannot be 
readily observed in the world. Third, it gives the child a way 
of learning from a very small database. This is because the 
number of subcategorization frames associated with each verb is 
smali (on the order of 10 - 20), and these are the data 
requirements for the procedure to work. Fourth, that database 
is categorical rather than probabalistic: Though verb uses to 
the child are often pertinent to what is going on in the here- 
and-now, sometimes they are not (e.g., the mother may speak of 
running to the store while she sits in her parlor). In con- 
trast, mothers virtually never speak ungrammatically to their 
children -- that is, use verbs in nonlicensed syntactic 
environments (Newport, 1977). Thus the child can take one or 
two instances of a verb in some frame as conclusive evidence 
that it is licensed in this environment. Finally, what is used 
in this procedure for learning is part of what must be known by 
an accomplished speaker: Knowing the subcategorization privi- 
leges for each verb is part of what it means to be an English 
speaker. In contrast, many of the situational analyses 
constructed along the way by the semantic bootstrapper will not 
figure in the final definition of a verb. 


In the light of all these virtues, it would be nice if this 
theory turned out to be part of the truth about how the verb 
vocabulary is acquired. I will provide some empirical evidence 
in its favor below. But first some presuppositions of the 
position have to be defended before so apparently “abstract” a 
procediu.re can be considered viable at all. I turn now to such 
questions. 


omen boot otheses 


The bootstrapping hypotheses involve a number of presup~ 
positions that require demonstration in their own right, lest 
all learning questions be begged. In company with all known 
theories of word learning, they presuppose that the human child, 
by natural disposition (or learning during the prelinguistic 
period) is able to conceive of such notions as ‘running’! and 
‘looking! and implicitly understands that words make reference 
to such acts and events. Past this background supposition, both 
semantic and syntactic bootstrapping procedures -- but especial- 
ly the latter -- make very strong claims about the child's 
knowledge as verb learning begins. YI will now go through these 
claims, mentioning some of the experimental evidence that gives 
them plausibility. 
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syntactic structures associated with verbs are uncorrelated with 
-~- OF hardly correlated with -- their meanings, then the child 
can't learn much about the meanings by observing the structures. 
No one doubts the sheer existence of such form/meaning regulari- 
ties owing to the results achieved by a generation of linguists, 
notably Gruber, Fillmore, Vendler, Jackendoff, and Levin (and 
many others), but questions can be raised about the stability, 
degree, and scope of these relations. That is, how far can a 
syntactic analysis such as that in Table 2 succeed in partition- 
ing the lexicon semantically for the child learner? 


I'll mention one line of investigation of these questions 
from our laboratory. Fisher, Gleitman, and Gleitman (in press) 
reasoned as follows: If similarity in the range of subcategori- 
zation frames of verbs is correlated with similarities in their 
meanings, then subjects asked to partition a set of verbs (a) 
according to their meanings and (b) according to their licensed 
structures should partition the verbs in much the same ways. 
To test this idea, one group of subjects made judgments of mean- 
ing-similarity for triads of verbs presented to them. Specifi- 
cally, they chose the semantic outlier in each triad (e.g., 
shown eat. Grink, and sing, they choose sing as the outlier, but 
shown eat, drink, and quaff they might choose eat). A semantic 
space for a set of verbs was derived from these data by tabulat-~- 
ing how often two verbs stayed together (were not chosen as 
outlier) in the context of all other verbs with which they were 
compared. Presumably, the more often they stayed together, the 
more semantically similar they were. A second group of sub- 
jects gave judgments of grammaticality for all these same verbs 
in a large number of subcategorization frames. A syntactic 
space was derived in terms of the frame overlap among them. The 
similarity in the syntactic and semantic spaces provided by 
these two groups of subjects was then compared -statistically. 


The finding was that the frame overlap among the verbs is a 
very powerful predictor of the semantic partitioning. In short, 
verbs that behaved alike syntactically were, to a very interest- 
ing degree, the verbs that behaved alike semantically. Such 
results begin to show that a syntactic partitioning of the input 
can provide important evidence for a learner who is disposed to 
use such information -- as was conjectured for the blind child, 
see Figure 3. 


e semantic/syntact ep y- isp | ne same ess 

? The first proviso to the conclusion just drawn, 
for learning questions, is that the semantic-syntactic relations 
have to be about the same across languages. Otherwise, depend- 
ing on the exposure language, different children would have to 
perform different syntactic analyses to derive aspects of the 


meaning. And that, surely, begs the questions at issue. 
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Recent theorizing in linguistics does support the idea that 
there are semantic/syntactic linkages that hold across lang- 
uages. In a recent version of generative grammar (Government/- 
Binding theory; see Chomsky, 1981), some of these relationships 
are stated as universal principles of language design. One 
example is the mapping of entities implied by the verb logic 
one-to-one onto noun-phrase positions in the clause: Every NP 
in a sentence must receive one and only one thematic role (the 

- “= ie Moreover, a related principle (the 

_prine ) states that the theta-criterion will hold 
at every level of a derivation; in particular, that argument 
structure is preserved on the surface clause structures. This 
is just the organization required by a bootstrapper -- semantic 
or syntactic. 


Talmy (1975; 1985) has investigated a number of typologi- 
cally quite different languages and found a variety of striking 
similar:ties in how their semantics maps onto the syntax. For 
those who prefer experimental evidence fron linguistically naive 
subjects, Fisher et al, in a very preliminary cross-linguistic 
foray with their method, showed that the relationship between 
being a verb of cognition and accepting sentence complements is 
as strong and stable in Italian as in English. 


The two relationships just mentioned (that a NP is assigned 
to each participant in the event, and that verbs encoding the 
relation between an agent and a propositon accept sentence 
complements) are not only true cross-linguistically. They have 
a kind of cognitive transparency that makes them plausible as 
part of the presuppositional structure children might really 
bring into the language learning situation. As Jackendoff puts 
this point: 


In order to lighten the language learner's load further, it 
seems promising to seek a theory of semantics (that is, of 
conceptualization) in which the projection rules are 
relatively simple, for then the child can draw relatively 
straightforward connections between the language he hears 
and his conception of the world. The methodological 
assumptions for such a theory would be that syntactic 
simplicity ideally corresponds to conceptual simplicity; 
grammatical parallelisms may be cluses to perceptual 
parallelisms; apparent grammatical constraints may reflect 
conceptual constraints. 
(1978; p. 203) 


From these and related arguments and demonstrations, I think the 
plausibilty of the bootstrapping theories receives at least some 
initial defense. 


Can the learner analyze the sound wave in a way that will 
36 
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Vv 2 There is a timing 
difference in the requirements of semantic and syntactic 
bootstrapping approaches: For the latter approach, the learner 
has to be able to parse the sentences that she hears in order to 
derive a syntactic analysis; moreover, at least some of the 
mapping rules have to be in place before the verb meanings ars 
known and thus the whole game is over. There is strong evidence 
supporting both these claims: 


ts pa : Once upon a time, not so very 
long ago, it was believed that babies could divide up the sound 
wave into words but not into phrases. This perspective neces- 
sitated complex theories for how learners could derive phrasal 
categories from the initial word-like representations (Wexler 
and Culicover, 1980; Pinker, 1984). In retrospect, these ideas 
were somewhat improbable. For one thing, there is evidence that 
infants are sensitive to such physical properties of the wave 
form as change in fundamental frequency, silent intervals, and 
syllabic length, all of which are universal markers of phrase 
boundaries (see, e.g., Fernald, 1984). As Gleitman and Wanner 
(1982) pointed out, the physical correlates of word segmentation 
are far more subtle and less reliable. More generally, our 
reading of the cross-linguistic facts about language learning 
led us to propose that the infant's analysis of th wave form 
was as a rudimentary phrase-structure tree.+1 In a similar 
vein, Morgan and Newport (1981; Morgan, Meier, and Newport, 
1988, showed in a series of artificial language-learning 
experiments that adults could learn phrase structure grammars it 
provided with phrase-bracketing information but not if provided 
only with word-level information. This finding led these 
investigators independently to the same proposal about the 
child's initial representation of the input wave forms. 
Recently, Hirsh-Pasek and her colleagues (1988a) have shown that 
prelinguistic infants listen to maternal speech doctored so as 
to preserve phrase- and clause-bounding information in prefere- 
nce to speech doctored so as to remove or becloud this informa- 
tion (see Gleitman et al, 1987, for a review of the evidence and 
its interpretation for a language acquisition theory). 


11 Notoriously, word-segmentation in a language like 
English is so fraught with ambiguity that new pronunciations 
(e.g., mother and apron replacing other and napron) are quite 
common. Moreover, there are long-lasting errors by children, 
e.g., one six-year old wrote "The teacher said, Class be 
smissed!" The phrasal parses suggested by Gleitman and Wanner 
were "rudimentary" to the extent that the unstressed elements in 
the phrases were presumed to be less well analyzed than the 
stressed elements, and the phrases were unlabelled (but see 
Joshi and Levy, 1982, for evidence that much of labelling, or 
its equivalent, can be derived from "skeletal" representations 
in which there are configurations but no overt labels). 
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The evidence just cited is not precise enough to give a 
detailed picture of the infant's phrasal parse!* However, that 
evidence is strong enough to support the view that children, 
even in the prelinguistic period, impose an analysis on the wave 
form sufficient for partitioning it into phrases. There is 
weaker but still suggestive evidence that the young learners 
also have the wherewithal to label the phrases differentially 
(see again footnote il )- It is incontrovertible that the two 
and three year olds who are the real verb learners can achieve 
the analyses of input shown in Table 2, and which are a re- 
quirement for achieving the semantic partitioning of the verb 
set shown in Figure 3. 


respondance rules? A crucial further requirement for the 
bootstrapping hypotheses is that the child understand the 
semantic values of the subcategorization frames. A child who 
recovers the meaning from observation, and who is to deduce the 
structures therefrom, has to know what the semantics of the verp 
implies about the syntactic structures licensed. And a child 
who recovers the srntactic structures licensed for verbs from 
the linguistic contexts in which she hears them has to know what 
semantic elements are implied by participation in these struc- 


tures. As Jackendoff emphasized, the burden of learning would. 


certainly be reduced for a child in possession of such informa- 
tion. But do real learners actually have it? There is striking 
evidence that they do. 


Golinkoff et al (1987) developed a very useful paradigm for 
studying very young children's comprehension. Essentially, 
they adapted a procedure designed by Spelke for studying infant 
perception. The set-up for the language case is shown in Figure 
4. The child sees different scenes displayed on two video 
screens, one to the child's left, one to her right. The scenes 
are accompanied by some speech stimulus. The mother wears & 
visor so that she cannot observe the videos and so cannot give 
hints to her child. Hidden observers are so positioned that 
they cannot observe the video, but they can observe which way 
the child is leoking, and for how long. It turns out that 
children look sooner and longer at the video that matches the 
speech input. 


In a first demonstration relevant to the syntactic boot- 
strapping hypothesis, Golinkoff et al showed that 19-month old 
children -- many of whom had never put two words together in an 
utterance, and knew few if any verbs ~~ understand some facts 
about the semantic values of English constructions. Two 


12 But see Eccles and Newport, forthcoming, for experimen- 
tal findings that support significant theorizing in this area. 
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simultaneous videos showed cartoon characters known to the 
children interacting. For some subjects, the stimulus sentence 
was st - For the cthers, it was 


co _M: ¢ . The children demonstrated 
by their selective looking that they knew which sentence des- 
cribed which observed event: They looked longer at the screen 
showing Big Bird tickling Cookie Monster when they heard the 
former sentence, and at the screen showing Cookie Monster 
tickling Big Bird when they heard the latter sentence. That is, 
these children recognize the order of phrases (or something 
approximating phrases) within the heard sentence and also 
understand the semantic significance of the ordering for the 
propositional interpretation of English speech (see also Slobin 
and Bever, 1982, for cross-linguistic evidence on this topic). 


I and my colleagues (Hirsh-Pasek et. al, 1988b) used this 
same procedure to investigate one more property of the mapping 
rules, namely the causative structure for which Bowerman (1982) 
had found many innovative uses by youngsters: Roughly, intran- 
sitive motion verbs (e.g., Big Bird tuxyns) can be "transitiviz~- 
ed" in English and then will express the causal agent as well 


(Cookie Monster turns Big Bird). 


To study this question using the procedure of selective 
looking, it is necessary that both entities appear in the 
stimulus sentence; otherwise the children may use the relatively 
trivial strategy of looking at the stimulus showing Big Bird if 
and only if Big Bird is mentioned. Hence the real stimuli used 
were, for example, Big Bird is turning Cookie Monster and Big 

° One video showed the two 
characters turning side by side, and the other video showed one 
character physically causing the other to turn. In addition to 
verbs like turn that (by maternal report) were probably known to 
the 2-year old subjects, unknown ones were also used. For ex- 
ample, the characters were shown flexing their arms, or one 
flexing the arms of the other, along with the stimuli Biq Bird 

efele) and 
Monster. We were unable to show stable effects of the 
syntactic structure for children at 24 months of age. But just 
about every youngster by 27 months showed the effect of the 
structure, by looking longest at the syntactically congruent 


screen. 


The conclusions to be drawn are very important ones for the 
syntactic bootstrapping hypothesis. The paired actions are the 
same, e.g., both are of turning in a circle, or both are of 
flexing the arms. What differs is whether a causal agent of 
that action is also present in that scene. The children seem 
to know that only the transitive use of the verb can be ex- 
pressing that cause. More strongly, that causal agent cannot 
be in an oblique argument position (the with phrase). 
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Prior demonstrations of knowledge of mapping rules have 
generally been with much older children. For instance, Bowerman 
notes that most spontaneous overgeneralizations of the causative 
structure ("Don't eat the baby!") are later, in the three to 
five-year old period.13 Pinker and his colleagues have offered 
many compelling demonstrations of a variety of mapping rules, 
but again mainly with three to five year olds. These findings 
give general su port to the idea that learners recruit the 
semantic/syntactic correlations somewhere during the course of 
learning. But the early appearance of these skills is crucial 
as support for the notion that the child has the mapping rules 
under control early enough for them to contribute to the 
acquisition of the verb meaninas themselves. 


Usina syntax to acquire verb meanings 


So far I've tried to show that. a number of presuppositions 
of syntactic bootstrapping are reasonable: The language does 
exhibit strong and stable syntactic/semantic correlations, and 
these powerfully predict adult classificatory behavior; children 
in the prelinguistic period can and do parse sentences to re- 
cover the analyses required for extracting subcategorization 
frame information; such phrasal information is a requirement for 
language learning, at least for aduits in the artificial lang~ 
uage-learning laboratory; children at a very young age and 
language-learning stage understand the semantic values of at 
least some syntactic frames. 


All of these findings were prolegomena to the syntactic 
bootstrapping approach. They were adduced because it is bad 
enough that this approach seems SO unnatural and formal a one 
for a child to choose; it would be worse if the child couldn't 
come up with the analyses that the nosition presupposes. But. 
now that I've presented at least some preliminary support that 
children can meet these prior requirements, the question 
remains: Do they use syntactic evidence to decide on the 
meaning of a new word? 


i 


13° But see also Naigles, Gleitman, and Gleitman (in press) 
for a demonstration that two year olds understand the sig- 
nificance of new motion transitives, even though they may not be 
brave enough to invent any until they are three. The subjects 
here were asked to "act ouc" scenes using @& Noah's Ark and its 
animal inhabitants. For instance, the child might be told to 
act out "Noah brings the elephant to the ark." But some of the 
stimuli were more unusual, e.g., “Noah comes the elephant to the 
ark" or "The elephant prings to the ark." The children by 
their acting-out performances showed that they thought transi- 
tive come means 'bring' and that intransitive bring means come. 
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The first, and justly famous, work on this topic was done 
by Roger Brown (1957). He showed three to five year olds a 
picture in which, say, spaghetti-like stuff was being poured 
into a vessel. Some subjects were asked to show some qgorp, 
others a gorp, and:still others gorping. The subjects' choices 
were, respectively, the spaghetti, the vessel, and the action. 
Evidently, the semantic core of the werd classes affects the 
conjecture about the aspect of the scene in view that is being 
labelled linguistically. 


Brown's result, though alluded to respectfully, just sat 
there for twenty years or so because in this respect as in many 
others Brown was a theorist ahead of his tine. Eventually, 
MacNamara took up and advanced these ideas: In his important 
1972 paper, he argued forcefully for the place of language 
structure in language acquisition. Experimentally, Baker, 
Katz, and MacNamara (1974) showed that children as young as 19 
months used the structure in which new nouns appeared (a 9Oorp vs 

)} to decide whether a new word encoded a class or an 
individual (i.e., a dell of the gorpy type, or some doll named 
Gorp). Thus the lex:cal category assignments of words were 
shown to carry semantic implications, and these were evidently 
recruited by learners. 


Naigles (in press), working in my lab and also in the lab 
of Hirsh-Pasek and Golinkoff at Temple University, extended this 
kind of demonstration to the case of verb learning (that is, to 
the usefulness of syntax for drawing semantic inferences within 
a single lexical category), thus giving the first direct 
demonstration of syntactic bootstrapping at work. 


Twenty-four month olds were again put into the selective 


looking situation. This time, however, their task was to 
decide between two utterly disjoin. interpretations of a new 
verb. In the training (learning) pericd, they saw a single 


screen, and the following mad event: A rabbit is pushing a duck 
down into a squatting position with his left arm (these were 
people dressed up as rabbits and ducks so they did have arms). 
fhe duck pops up, and the rabbit pushes him down again, etc. 
Simultanously, both rabbit and duck are making big circles in 
the air with their right arms. Some children heard a voice say 
and other children heard The 


rabbit and the duck are gorping as they watched this scene. 


Succeeding the observation, the screen goes dark and the 
voice is heard to say something syntactically uninformative, 
e.g. ' : 's 4 Now new videos 
appear on two screens, as shown in Figure 4. On one screen, 
the rabbit is pushing the duck down (but with no arm-wheeling) . 
On the other screen, rabbit and duck are wheeling their arms 
(but with no squatting or forcing to squat). The child's 
looking time at the screens, as a function of his syntactic 
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Ridden Speaker 


Nidden Odserver 


Child on Mother's Lap 


Figure 4: Set-up for the selective looking experiments (from 
Naigles, in press) 
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introducing circumstances, is now recorded (deuble-blind as 
usual, i.e., neither the mother nor the experimenters know 
which screen the child saw during the training period). 


Naigles' result was that virtually every 24-month old 
tested -- and there were many, this being a Ph.D. thesis-- 
showed the effect of the syntactic introducing circumstance. 
Those who heard the transitive sentence apparently concluded 
that gorp means 'force-to-squat.' Those who heard the intransi- 
tive sentence concluded that gorp means ‘wheel the arms. '!4 


What shall we conclude from this experiment? Clearly the 
child uses the event-context in some way to license conjectures 
about a verb meaning. But in this case, "The Main Event" is 
ambiguous not only in principle but in fact. Under these 
trying circumstances, at least, the learner attends to the 
eat potential of the semantically relevant syntactic 
evidence. 


A. question of scope 


So far the experiments I've mentioned have lingered ner- 
nervously around a few constructions, e.g. the lexical causative 
in English which is a notorious focus of syntactic extension by 
adults as well as children. Even if it is accepted that. 
children sometimes do use syntactic evidence to bolster their 
semantic conjectures, how broad can the scope of such a 
procedure be? Maybe its role is just to clean up a few little 
details that are hard to gleen from the world -- just backwards 
semantic bootstrapping, as Pinker has sometimes put the matter. 


The relative roles of linguistic and extralinguistic 
observation as the source of word-meaning acquisition is not 
within calling distance of settlement, of course. But the 


14 Notice that in all the selective looking experiments 
I've mentioned all the participants are animate so there's no 
room for counter-interpretations such as the strategy of 
assigning the animate entity to the subject position. Note also 
that in the present experiment the intransitive sentence 
contained a conjoined nominal (The duck and the rabbit) and this 
might be seen as a defect: Maybe the child knows the difference 
beteen a preverbal and a postverbal nominal rather than the 
difference between a transitive and an intransitive structure. 
This interpretation is effectively excluded by the version 
presented earlier (Hirsh-Pasek et al, 1988b) in which the two 
noun-phrases appear in different argument positions, one 
serially before and one after the verb ( 

Cookie Monster). For elegance, however, it certainly would be 
nice to redo the present experiment with the stimulus type used 
in the former one. 
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burgeoning linguistic and psycholinguistic literature on lexical 
semantics suggests that the semantic/syntactic linkages may be 
quite pervasive and stable, and play a potent role in organizing 
the verb lexicon. 


Fisher, Hall, Rakowitz, and I have just completed some 
studies designed to investigate the scope of children's ex- 
ploitation of the syntactic environment in learning new verb 
meanings. I believe that our prior studies with children two 
years old and younger yield evidence that satisfies an explan- 
atory demand of this approach: The bootstrapping procedure has 
to be able to operate very early in the child's linguistic 
life, else it hardly explains how verbs are acquired. 


Nevertheless, the selective looking paradign (which is one 
of very few that work with toddlers) is too much of a straight- 
jacket to be the only vehicle for extensive investigation of 
this approach. It is tedious in the extreme to set up (requir- 
ing the preparation of movies, etc.), takes hoards of infants 
to carry out (for some scream oF sleep or worse and have to be 
removed from the premises; and only a few trials can be pre- 
sented even to the more docile infants), and yields probabilis- 
tic results. (in part because the subjects are not notified 
directly of the task they are to perform). Moreover, it may 
very well be that the child's knowledge of the linking rules 
expands as his language knowledge grows, creating more latitude 
within which he can learn new meanings from linguistic evidence 
(After all, in the end we can do it by looking in the diction- 


ary). 


We therefore set out to see whether preschoolers (aged 3 
and 4 in the version now presented) would give us meanings in 
response to linguistic/situational stimuli upon request. The 
idea derived from a manipulation attempted by Marantz (1982). 
He had asked wnether children are as quick to learn noncanonical 
vs noncanonical mappings of semantics onto syntax. He intro- 
duced children to novel verbs as they watched a movie. For 
instance, one movie showed a man pounding on a book with his 
elbow. Marantz! question was whether children were as quick to 
learn that The book is moaking Larry (the noncanonical mapping) 
was a way of describing this scene as that a] 
book (the canonical mapping) was a way of describing the scene. 


Although the manipulation was an interesting one, unfor- 
tunately Marantz never asked the children how they interpreted 
the scene, so his results are not really relevant to understand- 
ing the child's perception of syntactic/semantic correlations. 


That is, Maranz that a scene viewed has only a 

single interpretation, an j@ea I have strenuously opposed 

throughout this discussion. My colleagues and I revised this 

experiment, changing the measure so we could find out about the 

child's comprehension in these circumstances. In essence, we 
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asked how the nonsense word is interpreted within differing 
linguistic environments. As a first step, we showed the 
moaking scene (in which Larry pounds the ball with his elbow) to 
adults. If we said "This scene can be described as a “moaking 
scene" and then asked them what moak meant, they said "“pound- 
ing." And if instead we showed them the scene and said "This is 
Larry moaking the book," they still asserted that moak means 
"pound." But when we showed them the scene and said "This is 
the book moaking Larry," they answered that moak means “hurt" or 
tresist.™ 


This suggests that adults make use of the fact that 
particular surface syntactic structures are associated with 
particular semantic values. They seem to bootstrap the 
meaning from examination of the scene taken together with its 
syntactic expression, just as the syntactic bootstrapping 
procedure claims. To be sure, the contextless presentation of 
moak with this scene irresistably yields the concept ‘pound’ as 
its interpretation. So there's much to be said for the idea of 
"salience" in the interpretation of events (though, to be sure, 
no one knows what exactly). But the important point is that 
there is a categorical shift in interpretation of the same scene 
-- to a less salient, but still possible, interpretation -- in 
response to its linguistic setting; namely ‘pound' if Larry is 
in the subject position, but ‘hurt' if the book is in that 
position. 


Fisher et al now adapted this procedure for children. We 
took advantage of the idea, popularized by such Penn developmen- 
talists as Gelman, Waxman, Macario, and Massey, that preschool- 
ers will do just about anything to help out a puppet. We intro- 
duced a puppet saying "This puppet sometimes talks puppet-talk 
so I can't understand him: can you help figure out what he 
means?" The children were happy to oblige. They were shown 
videotaped scenes in which animals were performing certain acts. 
For example, a rabbit appeared, looked to the left, and then ran 
rapidly off the screen toward the right; directly behind him ran 
a skunk, also disappearing at the right. Then the child would 
hear either "The rabbit is gorping the skunk" or else "The skunk 
is gorping the rabbit." 


The structures investigated are shown ir. Table 4e They are 
designed to ask whether the child is sensitive to the number of 
argument positions (stimuli 1 and 2), the structural positions 
of agent and patient (stimuli 3 and 4), and the structural 
positions taken together with prepositional markers of the 
oblique roles (stimuli 5 and 6). Thus we now began to inves- 
tigate the scope of the structural/semantic linkages to which 
learners may be sensitive. Notice that the pairs chosen are 
just the kind that I have discussed throughout: The same 
scenes, multiply interpretable, are shown but accompanied by a 
novel verb used in varying constructions. 


SCENE STIMULUS SENTENCE 
1. a) Rabbit eating. The rabbit moaks. 
b) Elephant feeding rabbit The elephant moaks the rabbit. 
2. a) Monkey pushing elephant. The monkey pumes the elephant. 
b) Elephant falling The elephant pumes. 
3. a) Monkey riding elephant. The monkey gorms the elephant. 


b) Elephant carrying monkey. The elephant gorms the monkey. 


4. a) Rabbit fleeing skunk. The rabbit zarps the skunk. 
b) Skunk chasing rabbit. The skunk zarps the rabbit. 
5. a) Rabbit giving a ball to The rabbit ziffs a ball to 
elephant. the elephant. 
b) Elephant taking a ball The elephant ziffs a ball 
from rabbit. from the rabbit. 
6. a) Skunk putting blanket on The skunk is biffing a 
j monkey. blanket on the monkey. 
b) Skunk covering monkey The skunk is biffing the 
with a blanket the monkey with a blanket. 


Table 4: Stimuli used by Fisher, Hall, Rakowitz, and Gleitman 
(forthcoming). All Ss were exposed to the same six scenes (each 
scene has two plausible interpretations, called a) and b) in the 
left-hand column. Along with these scenes, half the children 
heard a) stimulus sentences and half heard b) stimulus sentences 
(with appropriate counterbalancing across Ss and stimuli). 
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The findings are shown in Table 5. They are presented in 
terms of the likelihood of various responses depending on the 
introducing syntactic structure. For example, the response give 
(response A) to structure (a) in Table 4 (The rabbit ziffs a 

)} was made by 4 Ss, but the response take 
(or, equivalently, get) was made by only 2 Ss in this condition. 
Symmetrically, the response take or get (response B) was made by 
5 Ss in response to structure (b) in Table 4 (The elephant ziffs 
a_ball from the rabbit), while that response was never made to 


structure (a). 


Overall, 71 relevant responses made by these children were 
congruent with the semantic value implied by the syntactic 
structure, while only 13 relevant responses were inconsistent 
with the structural information. Moreover, for each scene and 

, the number of syntactically congruent 
responses is greater than the noncongruent responses. The level 
of congruence was about the same for all three semantic/syntac- 
tic relations studied: 83% congruent responses when the 
variable was number of noun-phrases, 89% congruent responses 
when the variable was structural position of these noun-phrases, 
and 81% congruent responses when the variable was position plus 
prepositional marking. 


One might object that these children are "merely" paraphra- 
sing verbs that they previously know to occur in these syntactic 
environments. That is true, but it does not take away serious- 
ly from our interpretation of these findings: The children 
knew, evidently, that the appropriate paraphrase had to be one 
which fit both with the scene and with the sentence structure 
heard. This is the reverse of Pinker's Claim that the verb 
Meanings must be acquired by extralinguistic observation in 
advance of, and as the basis for, deducing their appropriate 
syntactic structures. But the results are exactly those 
expected in the syntactic bootstrapping approach. 


A note on the input corpus 


One of several holes in our present evidence has to do with 
the characteristics of caretaker speech. I have presented a 
single example corpus (Table 2) tending to support the idea that 
caretaker speech is rich enoucn to yield quite a full range of 
structures to support the syntactic bootstrapping mrocedure. 
And this corpus was for a mother speaking to a blind child, 
whose word-learning situation may be quite special. We are now 
analyzing an extensive corpus of mother/child speech in a natu- 
ralistic setting (originally collected by Landau and Gleitman) 
to see whether children characteristically receive the range of 
structures adequate to support a realistic syntax-based proce- 
dure (Lederer, Gleitman, and Gleitman, 1989). So far, the 
prospects from this larger database look good. Lederer finds 
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Syntactic type 


of the stimulus Response A Response B 
a b Sa Sp Sb Sa 
eat feed 7 pA 6 1 
push fall 8 3 4 a) 
ride carry 7 2 4 0 
flee chase 6 0 8 1 
give take 4 2 5 0 
put cover 8 3 4 0 

TOTALS: 40 11 31 2 


Table 5: 16 Ss (aged 3-4) asked: WHAT DOES BIFFING MEAN? 
Not all subjects answered every question, accounting for totals 
in each row not totalling to 16. Also, some responses were 
irrelevant to either interpretation of a stimulus, e.g., S might 
say in response to the flee/chase scene "They're having fun!" 
These irrelevant stimuli are excluded from this tabulation. 
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that each of the 24 verbs most often used by these mothers to 
their children has a distinctive syntactic distribution. When 
the usages are pooled across mothers, these distinctions are 
preserved. 


The next question is whether these syntactic distributions 
map onto a semantic space coherently. An independent assess- 
ment of the semantic relations among these verbs is required as 
the evidence. Lederer therefore is now testing this issue by 
using these verbs in the kind of manipulation employed by Fisher 
et al: namely, asking adult subjects for judgments of the 
semantic outlier in all triads of these verbs. Preliminary 
inspection of the verbs suggests that the semantic clusters that 
emerge from these data are strongly predicted by the syntactic 
overlaps in the maternal corpora. 


Conclusions 


I began discussion by acknowledging the intuitive power of 
Locke's view that words are learned by noticing the real-world 
contingencies for their use. Then I tried to show that such a 
word-to-world mapping, unaided, was in principle insufficiently 
constrained to answer to the question of how the child matches 
the verbs (qua phonoldgical objects) with their meanings. The 
solution that I and my colleagues have offered was that 
semantically relevant information in the syntactic structures 
could rescue observational learning from the sundry experiential 
pitfalls that threaten it. This theory, of course, is the very 
opposite of intuitive. But when probable solutions fail, less 
probable ones deserve to be considered. I therefore sketched a 
rather wide-ranging empirical review that we have undertaken to 
sea whether, after all, children might not be deducing some of 
the meanings from their knowledge of structural/semantic 
relations. I believe that the evidence we now have in hand 
materially strengthens the plausibility of the viewpoint. 


Still, the conclusions that can be drawn currently about 
the generality and pervasiveness of syntactic bootstrapping must 
be exceedingly tentative, on a variety of grounds. Some of 
these I have discussed: No one has more than a glimmer of an 
idea about just how the verb lexicon is organized, and therefore 
we don't really know how much information about semantics can be 
gleaned from that organization. Also, we have at present only 
the most meager data concerning the orderliness and richness of 
the child's syntactic input. Facts about the cross-linguistic 
similarities in the syntax/semantics correspondances are also 
extremely fragmentary, currently. 


There are in addition numerous problems with the analyses 
performed that I have altogether skirted so far. For example, 
it is not an easy task to decide which structures co-occurring 
with verbs should actually be considered part of the frame 
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specifications, and which are merely adjuncts. To construct 
Table 2 (and in Lederer's ongoing work) we had to make some 
choices, but some of them may be wrong. And if we had these 
problems in assigning structural descriptions. to the mother's 
utterances, isn't the learner similarly beset?25 Another huge 
problem is the "idiomatic" verb uses that I mentioned in passing 
(footnote 10), e.g.-, 
his enenies in the eye, etc. It may be significant that these 
monstrosities are just about totally absent from the maternal 
corpora we have examined, but absence in fact (rather than in 
principle) is a pretty weak reed on which to build so strong a 
position as the one I've tried te defend. 


The largest problem of ali is how learners acquire the 
semantic/syntactic linking rules in the first place. Bower- 
man's evidence, and all the findings I've just discussed, are 
understandable only (so far as I can see) by asserting that 
learners are in possession of such linking rules. But where dic 
they come from? In the present discussion, I've subscribed to a 
version of Jackendoff's hypothesis that the linking rules are 
somehow cognitively transparent to the child. But since there 
js at least some cross-linguistic variance in such syntac- 
tic/semantic regularities (see Talmy, 1985), I admit that I'd be 
happier to find that they could be derived from some more 
primitive categories or functions. The problems here cry out 
for serious investigation. 


In light of the various issues just mentioned, one must 
remain agnostic about the bootstrapping proposals, at present. 
But I hope I've persuaded you that the prospects they open for 
explanation of the verb-learning feat are enticing enough to 
make continued investigation seem worthwhile. 


It remains to point out that, by their nature, both 
semantic and syntactic bootstrapping are perilous and errorful 
procedures and their explanatory power must be evaluated with 
this additional proviso in mind. Bowerman's children, drawing 
syntactic conclusions from meaningful overlap, are sometimes 
wrong. Errors are made insofar as the scenes are multiply 
interpretable; for instance, youngsters often interchange win 


15 here is some evidence in the literature of adult 
speech perception that adjunct and argument phrases may be 
intonationally distinguishable (see Gleitman and Wanner, 1982, 
for a review: and Carlson and Tannenhaus, 1988, for some 
experimental evidence). These distinctions, if real, can be 
expected to be exaggerated in maternal speech. Nevertheless, 
the issues here are quite complex and have not been thoroughly 
studied by any means. And they do bear in serious ways on the 
rena of work that syntax can be expected to do for the verb 
earner. — 
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and beat, presumably because these occur in exactly the sane 
circumstances. But syntactic bootstrapping is no more free of 
potential error. This is because the form-to-meaning mapping 
in the exposure language is complex and often inexact. For 
instance, exit, enter, reach and touch differ from most verbs 
describing directed motion through space in not requiring 
prepositional phrases to express the wotion paths (compare come 
into the reom but enter the room). One outcome of this inexact 
mapping of form onte meaning is errorful learning (e.g., the 
child may say "I touched on your arm") and its end point, 
language change (e.g., while exit the stage was the more common 
in Shakespeare's time, exit from the stage is now on the 
ascendancy). Short of changing the language, how do learners 
recover from such errors? 


The position I have been urging is that children ferret out 
the forms and the meanings of the language just because they 
can play off these two imperfect and insufficient databases 
(the saliently interpreted events, and the syntacticaliy 


interpreted utterances) against each other to derive the best 


fit between them. Neither syntactic nor semantic bootstrapping 
work all the time, nor taken together do they answer to all the 
questions about how children acquire their verb vocabulary. 
But I hope I've convinced you that each of these procedures 
works very well indeed when it does work, so the wise child 
should, and probably does, make use of both of then. 


-=ee Qe-- 


* This paper is the text of the keynote address delivered to the 
Stanford Child Languge Conference in April of 1989. The ideas 
contained in it were developed in collaboration with a number of 
colleagues and students, whose contributions are cited through- 
out the text. I am particularly indebted to two individuals 
who helped me throughout the preparation of this address. The 
first is my husband, Henry Gleitman, who -- as always -~ quietly 
contributed a large share of the ideas and most of whatever 
organization and coherence this draft contains. Anne Lederer 
has also been a crucial aid in offering significant ideas and 
helping me get my head together on some of what's said here. I 
should add that, beyond their intellectual labors on ny behaif, 
these colleagues were repeatedly willing to cut and paste, and 
even run and fetch, to help me meet deadlines. For both kinds 
of contribution, I am very grateful. I want also to express 
appreciation for a University of Pennsylvania Biomedical 
Research Grant (sponsored by the National Institute of Health 
under Grant # 2-S07-RR-07803-23) which underwrote the more 
recent experimental work that I report here. 
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ACQUISITION OF NOUN INCORPORATION IN INUKTITUT* 


Shanley Allen & Martha Crago 
McGill University 


1. Introduction 

This paper investigates the first language acquisition of productive noun incorporation 
in Inuktitut. It begins with descriptions of noun incorporation, relevant aspects of the 
structure of Inuktitut, and working criteria of productivity in sections 2, 3 and 4. It then 
presents acquisition data from Inuktitut in section 5 and corroborating data from West 
Greenlandic in section 6, and contrasts both of these with acquisition data from Mohawk in 
section 7. Finally, several explanations for the seemingly early acquisition of noun 
incorporation in Inuktitut are hypothesized in section 8. 


2. Noun Incorporation 

Noun incorporation (henceforth, NI) is a structure which appears in a large variety of 
genetically and typologically diverse languages. In NI, a particular noun root from the 
sentence appears inside the verb form rather than as an independent lexical item. The two 
roots appear to work together as a unit for purposes of agreement marking, case 
assignment, and other relevant processes. It is standardly assumed in a variety of 
frameworks that both Inuktitut and Mohawk evidence noun incorporation (Baker 1988; 
Mithun 1984; Rischel 1971; Sadock 1980, 1986). 


(1) a. Palasi-p miqi-@ niri-vaa. b, Palasi-@ niqi-tur-puq. 

minister-ERGsg meat-ABSsg minister-ABSsg meat-eat-3sS.INDIC 

eat-3sS/3sO.INDIC ‘The minister eats/ate meat.’ 

‘The minister eats/ate the meat.’ (Greenlandic; Rischel (1971)) 
(2) a. Wa?kyvtho? oji:ja?. b. Wa?kji?jayvtho?. 

wa?-k-yvtho? 0-ji:ja-? wa?-k-ji?ja-yvtho? 

AOR-1!sS-plant PRE-flower-SUF AOR-1sS-flower-plant 

‘Il planted a flower.’ 'l planted a flower.’ 

(Mohawk; Bonvillain (1974)) 


In the (a) examples, the structural object noun roots appear as independent lexical items 
with their own case marking. In the (b) examples, however, the noun roots appear inside 
the verbal complex, case and other inflections having been dropped. Aso to be noted in 
Inuktitut is that the verb is inflected for both subject and object in the unincorporated form, 
but only for subject in the incorporated form. 


* We wish to thank Betsy Annahatak and Lizzie Ninguiruvik for useful discussion of various elements of 
this paper, and Mark Baker, Michael Fortescue, Marianne Mithun, and Lydia White for helpful comments 
and discussion on earlier drafts. Thanks also to Betsy for invaluable help in preparing the data for linguistic 
analysis, and to Johnny Nowra for seemingly endless data transcription. The research upon which this 
paper is based was funded in part by a Canadian Northern Studies Trust Studentship, an FCAR Fellowship, 
and a research grant from the Kativik Schoo! Board to the first author; and a research grant from the Toronto 
Sick Children's Hospital Foundation to the second author. 
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3. Grammatical Outline of Inuktitut 

Inuktitut (IKT) is a language of the Esl imo-Aleut family, and encompass s several 
mutually intelligible dialects across Norther. Canada. Typologically, it is noted for its 
highly polysynthetic nature and ap cea ay complexity. Words typically consist 
of a noun, verb, or adverbial stem followed by from 0 to 8 or more lexical and grammatical 
morphemes, then an obligatory inflectional suffix, and finally optional enclitics. 

Nominals are obligatorily marked for Case and number, and for person and number 
of possessor if applicable. Adjectival and other modifiers of the nominal which constitute 
separate words (i.¢. not bound morphemes) are treated as nominals in Inuktitut and take the 
same person and number inflections as those on the nominal which they modify. Verbs 
inflect for both subject and objects in absolutive Case, but for neither objects in secondary 
Case nor incorporated objects. Word order is generally assumed to be basic SOV, though 
because Inuktitut is a pro-drop language it is relatively rare to encounter a sentence 
containing all of subject, object, verb, and other modifiers. Within a noun phrase, word 
order is much more rigid: possessors precede the head noun, and modifiers follow it. 


4, Productivity 

One of the great difficulties in any study of acquisition is determining the point at 
which a child begins using a morpheme or structure productively: to at least subconsciously 
recognize a certain morpheme as having a particular function of its own in the word- 
building processes of a language. We will adopt the criteria for productivity in Inuktitut, 
following Fortescue & Lennert Olsen (to appear). The first criterion is obviously the most 
clear and strong, with the second and third following in that order. 


(3) CRITERIA OF PRODUCTIVITY 
1. The morpheme in question is wrongly attached to its stem in terms of correct rules 
of phonology or morphology. 
2. The morpheme in question appears in the transcript on at least two different stems, 
and preferably with two stems of phonologically different types so that two 
allomorphs of the morpheme are required. 
3. Alternatively, the stem appears with a different morpheme attached in the same 
place, elsewhere in the transcript. 


In terms of NI, it is most useful to refer to productivity of the verbs which allow 
incorporation since they are a much more restricted class than the nouns which may 
incorporate. An incorporating verb (henceforth, IV) will be termed productive, then, if it 
or the incorporated noun evidence attachment errors (criterion 1), if it appears in the 
transcript with at least two different nouns incorporated into it (criterion 2), or if the noun 
which incorporates into a particular verb appears elsewhere in the transcript either 
independently with nominal inflection or incorporated into another verb (criterion 3). 


5. Inuktitut NI 

This section investigates production data from one child speaker of Inuktitut, and 
;}ustrates that NI in Inuktitut is beginning to be acquired productively by at least 2;5. The 
data cited here are taken from 10 hours of videotaped naturalistic communication between 
an Inuk boy, Jaaji, and various members of his extended family, in Kangirsuk, Nouveau 
Québec. Tapings were done at 4-month intervals beginning at age 1;9. The sol. iaaguage 
of interaction among family members was Inuktitut. Since no instances of IN} were 
observed at 1:9, no data from that age will be considered. 
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§.2.1 Jaaji at 231 
Jaaji's NI structures at 2;1 are not overwhelming, but they do exist. However, only 


one of the IVs fits the criteria of productivity and even this is questionable on the basis of 
native speaker intuition. 


(4) a. Tiituq. b. Sikituurtualuit. .. 
tii-tuq sikituug-tuq-aluk-it 
tea-consume skidoo-ride-EMPH-3pS 
‘(1 want) to hav2 some tea.’ ‘They're riding skidoos... .' 


The IV rq, ‘to use for its intended use’, appears with several different incorporated nouns 
(henceforth, IN) of two phonological types which appears to be clear evidence for its 
productive use. However, each of these phrases is quite common in everyday speech, 
particularly that of young children, so it is conceivable that each is treated as an independent 
lexicalized unit. This hypothesis is strengthened by a mistake of omission shown in (5): 


(S$) * Umialauluuk? 
umiaq-lauq-luk 
boat-POL- 1dS.IMPER 
‘Let's go for a boat ride?’ 


In adult speech the morpheme tuq must immediately follow the noun wniag. Thus it 
seems that Jaaji may not have completely grasped the use of tug, or may only be using it 
lexically, since he is not using it in all obligatory instances. 

Two other IVs are productive under criterion 3: lag and liag in (6) and (7): 


(6) a. Kamilasiviit? b. Amiikka gaani. 
kamik-laq-si-vit kamik-Vkka gang-ani 
shoe-take.off-PRES-2sS. INTER shoe-1SduABS on.top-LOC 
‘Are you taking your shoes off?’ 'My shoes are on the top.’ 

(7) a. Qangattajuuliaq. b. Qangattujuu! 
qangattajuugq-liag gangatiajauq 
airplane-go.to e 
"We're going to meet the plane.’ ‘Airplane!" 


In the (a) examples, the nouns in question appear incorporated into verbs, while in the (b) 
examples they appear as independent elements with appropriate nominal inflection. 


5.2.2 Jaaji at 2;5 

By 2;5 Jaaji has acquired three productive IVs and a fourth, tq, is still inconclusive. 
First, liag now meets the first criterion of productivity. It appears correctly with two 
different incorporating nouns, one shown in (8), and is also a clear victim of 
overgeneralization as shown in (9): 


(8) Kuapalialangvuruu. (9) * Avunnguliaratta! 
kuapak-liaq-langa-vuguk av-unnga-liag-gatta 
coop-go.to-FUT- 1dS.INDIC over.there-motion.to-go.to- 1pS.PERF 
"We'll go to the co-op later.’ "We're heading there!' 
og 
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In (9), diag appears with an adverbial of direction incorporated into it. While adverbs of 
place often incorporate in Inuktitut, this one is already marked for directional movement by 
the affix -unnga and so its incorporation into liag is redundant and considered incorrect in 
adult speech. ! 

Two other IVs,u and aq are also productive at this age. Both appear wit: various 
INs, though neither varies phonologically in a fashion relevant to productivity. 


(10) a. Iqaluguluuvit? b. Marquuluta. 
iqaluk-ruluk-u-vit marqur-u-lutaé 
fish-pitiful-be-2sS. INTER two-be-1pS.IMAPP 
‘Are you a pitiful fish?’ ‘Let's be two of us.’ 

(11) a. Ataataqanngitutua? b. Umiajuaqarqugu. 
ataata-qaq-nngit-jug-tuaqg umiajuaq-qaq-vugut 
father-have-NEG-3sS.PART-only ship-have-1pS.INDIC 


‘He's the only one without a father?’ "We (too) have a ship.’ 


§.2.3 Jaaji at 2;9 

Jaaji has slightly expanded his repertoire of IVs at 2;9: one by criterion 2, three by 
criterion 3, and three inconclusive. The most productive is the copula u ‘be’, which 
appears with various INs and in two allomorphs. Three additional IVs, taaq, tuq, and si 
are termed productive by criterion 3. Consider the data in (12) and (13): 


(12) Taatialu paisikuttugulu. (13) Imaitruturumaviit? 
Taati-aluk paisikug-tuq-ruluk imaittug-tuq-guma-vit 
Taati-big bicycle-ride-pitiful this.kind-consume-want-2sS.INTER 
'Big Taati is pitifully riding a bicycle.’ ‘Do you want some of this kind (of food)?’ 


Here mq appesrs with two different nouns incorporated into it, demonstrating that it iS 
likely productive, and the following two examples provide corroboration by illustrating 
each of the INs used with a different IV. In (14) paisikugq is incorporated into taaq, 
parallel with (12), and in (15) imaistuq is incorporated into si, parallel to (13). This 
comparison also indicates the productivity of the two comparison IVs 


(14) Paisikuttaatu. 
paisikuq-taaq-juq 
bicycle-acquire-3sS.PART 
‘He got himself a bicycle.’ 


(15) Una kuukuumik imaittusilaarqanga. 
una kuukuu-mik imaittuq-si-laaq-vanga 
DEMsg kuukuu-INSTRsg this.kind-buy-FUT-2sS/1!sO.INDIC 
‘Buy me some of that kuukuu, some day.’ 


1 Note that this observation holds only dislects of Inuktitut spoken on the Ungava coast. On the Hudson 
coast the sentence in (25) would be considered correct in aduli speech. The child in question here does not 
have any regular contact with a speaker of that dialect. 

2 This verbal inflection is incorrect; it should be Lunuk ‘1dS.IMPER’. However, this mistake does not 
influence the consideration of the productivity of NI. 
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5.3 Stranding . | 
A more advanced step in acquisition of NI is the product'on of stranding structures. 


In stranding, lexical items such as adjectivals, numeral phrases and possessors which 
modify the noun and are included in the NP in unincorporated structures still exist and 
carry the same semantic relationships in incorporated structures, even though the noun 
which they modify has been incorporated into the verb complex and the modifier maintains 
its position outside the verb complex. . 

Production of stranding structures requires either the cognitive or structural ability to 
deal with the discontinuous dependency between the IN and its corresponding modifier, as 
well as the basic NI structure, and thus they constitute a more advanced step in the 
acquisition of NI. The child in this study did not produce any examples of stranding, 
which is not really surprising since it is undoubtedly more complex than NI itself and he 
was still in the beginning phases of dealing with NI. However we did encounter examples 
of stranding in observation of slightly older children in a nearby community. For instance, 
at about 3;0 the child was saying such sentences as in (16) with stranded numerals. 


(16) Marruunik aukulutturumajunga. 
marruuq-nik aukulut-tug-ruma-junga 
two-SECp! _chocolate.bar-eat-want-isS.PART 
‘I want to eat two chocolate bars.’ 


This concludes our look at NI acquisition data from Inuktitut. We will now look at some 
related data from other polysynthetic languages. 


6. Greenlandic NI 

Acquisition data from West Greenlandic (Fortescue & Lennert Olsen (to appear)), 
another dialect in the Eskimo-Aleut family, corroborates our findings from Inuktitut 
concerning NI. In addition, this data shows that basic stranding structures are certainly 
acquired by age 4;7. Examples from 4;7 and 5;2 are shown in (17) and (18) respectively: 


(17) Anaana ilaa uanga napparsimallunga pingasunik pinikuuvunga. 
anaana ilaa uauga nappar-sima-liunga pingasut-nik pinik-u-vunga 
mummy right I sick-PAST-isS.IMAPP three-SECpi things-be-1sS.INDIC 
'l once got three when I was sick, didn’t 1, Mummy?’ 


(18) Taava gimmit toqugunik allanik po ag a ned 
taava qimmeq-it toqu-gunik alla-nik _inissaq-qaq-nngit-ramikkit 
so dog-ABSpl die-4pS.IMPERF other-SECp! place-have-NEG-4pS/3pO.PERF 
'So when dogs die, since they don't have any other place for them... . 
(Fortescue & Lennert Olsen (to appear)) 


In (17) the numeral ‘three’ refers to the quantity of things which the child got, and thus the 
two items ‘three’ and ‘things’ must be construed in a stranding structure. In (18), the 
modifier allanik ‘other’ is stranded from the NP, inissag ‘place’, which it modifies. 


7. Mohawk NI 

Acquisition data from Mohawk, an lroquoian language, show that NI in Mohawk is 
not acquired productively until after age 6. Mithun (to appear) presents acquisition data 
based on cross-sectional study of § children learning Mohawk as a first .anguage. The 
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children, aged 1;9 to 4:9, were each observed and recorded for at least half a day, in casual 
circumstances at either home or school. Examples of NI first appear in the fourth child, 
aged 2;10, as shown in (19), and then in the fifth child aged 4;9, as shown in (20): 


(19) ronkwe’dksen 
r-onkwe't-aks-en 
MASCsgPAT-person-bad-STATIVE 


‘he is a bad man (Mithun (to appear: 27)) 
(20) a. kanahskwaksen b. iohné:tes 
ka-nagskw-aks-en io-hnot-es 
NEUTsgAGT-animal-bad-STAT | NEUTsgPAT-water.level-deep.STAT 
‘it is a bad animal’ ‘it is deep’ (Mithun (to appear: 39)) 


However, Mithun (to appear: 39) states regarding all instances of NI in her data that "there 
is no reason to suspect that [they] created any of the forms [themselves]. All of the 
combinations [they] used are heard frequently, and in many cases the constituent roots do 
not occur alone, so the forms were most likely learned as lexical units”. 

This concludes our overview of relevant data. We now tum to possible explanations 
of the seemingly early acquisition of NI in Inuktitut with some reference to the contrast 
with Mohawk. 


8. Possible Explanations of Differences 

Presumably there are some factors in effect, whether structural or sociolinguistic, 
which make it more difficult for Mohawk children than for Inuit children to produce NI 
structures. Several possibilities are discussed below. 


8.1 Verbal Affixation in Relation to N Root 

One interesting structural difference to note is the placement of verbal affixation in 
relation to the incorporated noun. Agreement, tense, reflexive and other affixes precede 
the V in the Mohawk verb complex, while all these affixes and more follow the V in the 
Inuktitut verb complex. This is relevant for two reasons. 

First is adjacency between the V and its affixes. Slobin (1985) observes in a cross- 
linguistic comparison of Japanese, Turkish, Polish and Hungarian that children evidence 
"preferences to keep grammatical markers of aspect, tense, and person close to the verb, 
while keeping negation and conditionality peripheral (Slobin 1985: 12).” This he attributes 
to the fact that tense and person are more inherently part of the meaning of the verb itself, 
while negation and conditionality have scope over the meaning of an entire clause. It is 
possible, then, that children might initially resist placing an IN in a position which 
increases the distance between a verb and its tense and person affixes. Since in Mohawk 
the IN must intervene in just such a position, most NI structures can be represented in an 
unincorporated form, and the process of NI tends to indicate a pragmatic effect 
encompassing the entire clause or sentence, children would presumably rather tend to leave 
the N unincorporated until later in the acquisition process. In Inuktitut, however, the IN 
does not block the adjacency of any affixes of person, tense, etc. since they all appear on 
the other side of the verb and therefore there is no reason why this factor of hierarchy of 
relevance should affect the acquisition of Ni in Inukutut. 

Second, it has been shown that that morphemes at word boundaries are more salient 
to children than those inside the word. In Mohawk the IN is well-entrenched inside the 
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verbal complex with various affixes on either side, In Inuktitut, on the other hand, the IN 
is always at the very beginning of the verbal complex. Thus it would not be surprising for 
the acquisition of NI to be influenced by this difference in salience of INs. 


8.2 Criteria for Use of NI - Optional/Obligatory 

A second possible explanation is that the criteria for use of NI are more restrictive or 
more clear in Inuktitut. NI in Inuktitut may be termed “obligatory” or "lexically governed” 
in that the verb into which the noun incorporates is only allowed to appear with an IN. NI 
in Mohawk, on the other hand is mostly “optional” or "stylistically governed” in that the 
verb which permits incorporation of nouns can also appear as an independent lexical item 
without an IN. 

One possible ramification of this derives from Slobin (1985) who states that children 
have a preference for analytic over synthetic expressions. It is interesting to note here that 
those examples of NI which do appear in the Mohawk acquisition data are all examples of 
“obligatory” incorporation: both the adjectival V roots and the noun which is incorporated 
into it may only appear in incorporating structures. Thus the earliest NI expressions to 
emerge in Mohawk are those which have no analytic counterpart, and analytic forms are 
otherwise used in child speech until at least age 6. It is slightly problematic, however, that 
even when more or less equivalent analytic counterparts exist in Inuktitut they are acquired 
later than the synthetic NI structures. 

A second possibility is that things which are lexically-governed are very clear in terms 
of which structure must be used. However, things which are stylistically-governed are 
quite a bit less clear and require more subtle interpretation. Therefore the child might find it 
easier in Inuktitut than in Mohawk to figure out when NI is to be used. 


8.3 Degree of "Usualness" of NI in Adult Speech 

A third possible reason for the early acquisition of NI in Inuktitut is the degree of 
"usualness” of NI in adult speech. When two or more structures are available to express 
basically the same meaning, and there is a feeling among native speakers as to which of the 
forms is the most usual, we intuitively expect the most usual form to be learned first, all 
other things being equal. 

Mithun (1984) presents the thesis that in most cases of noun incorporation the 
unincorporated form is the norm and NI takes place for a specific purpose. In this situation 
a child would be expected to acquire the unincorporated form first, then alter it as necessary 
according to the pragmatics of the situation at hand. Since Mohawk follows this pattem, it 
is not surprising to observe that NI is acquired quite late. 

Sadock (1986:25), however, notes that in many cases Greenlandic "provides no non- 
incorporated form of equal or less complexity and idiomaticity than the incorporated form." 
Thus it may well follow the pattern that in languages where NI is the normal and usual 
form"... it is not the case that ‘speakers . . . incorporate for a purpose [Mithun (1984)]’, 
but rather that they REFRAIN from incorporating for a purpose (Sadock (1986:21))”. 

Ina language like Inuktitut where NI is considered the "most usual" way to represent 
the concept at hand, a child would most likely learn the incorporated form first and produce 
the unincorporated form only at a later date. In fact, unincorporated forms in Inuktitut only 
Start appearing around age 4. 


8.4 Degree & intensity of Child Exposure to Language 


The final possibility we will put forth is a more sociological one having to do with the 
degree and intensity of the child's exposure to the language being learned. If exposure is 
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conversational partners in that language, acquisition is likely to progress more slowly than 
in an environment where the language is being used on a daily basis by almost all speakers. 


age learning. Mohawk is a language suffering fairly rapid attrition. It is spoken 
sdcits of grandparent eae few children are currently acquiring it as a 


we studied is Inuktitut. On the basis of this information it would not be unreasonable to 
suspect a differential level of exposure to the respective native language in the two 
societics, leading to differential acquisition in favor of Inuktitut. In fact, it almost seems 
that the Mohawk situation is an L2 rather than L1 learning situation. While it is unlikely 
that the acquisition of a structure per se would be radically affected by such a factor, the 
grasp of a structure used predominantly for semantic purposes might be since less exposure 
to the language may well decrease the speed with which the child picks up > mantic 
nuances. This would be especially relevant to NI in Mohawk since NI is used in that 
language for primarily semantic purposes (Mithun (1984)). It is certainly possible, 
however, that under more empirical testing no effect is evidenced. 
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Why do children omit subjects?! 
Paul Bloom 
Massachusetts Institute of Technology 


It is well known that children acquiring English frequently produce utterances with 
missing constituents. This paper is concerned with why children produce sentences without 
subjects, like those shown below (from Bowerman, 1973). 


(1) hug Mommy 
play bed 
writing book 
see running 


One theory of these utterances is that young children represent different grammars 
than adults. In particular, Hyams and her colleagues have proposed that all children start 
off with a pro-drop grammar, one where overt subjects are optional (¢.g., Hyams, 1986). This 
is the correct grammar for a pro-drop language like Italian but incorrect for a non pro-drop 
language like English, where overt subjects are obligatory. So children acquiring English 
need some sort of evidence in order to change their grammar from pro-drop to non-pro-drop. 
There are several different proposals of exactly what sort of input causes the parametric 
switch (e.g., Borer and Wexler, 1988; Hyams, 1986, 1987; Pierce, 1987). 

In this paper, evidence is presented tor an alternative explanation, which is that young 
children represent the correct grammars from the very start but omit subjects because of 
performance factors. This performance explanation of subjectless sentences motivates a 
considerable shift in how we look at the acquisition of pro-drop and non-pro-drop languages. 


Comparing the processing theory w'th the pro-drop theory 

Before discussing the empirical evidence, it is worth considering one strong motivation 
for assuming that such a processing limitation exists. It is often argued, on both empirical 
and theoretics! grounds, that young children represent the same sort of linguistic rules and 
principles as adults (Bloom, 1989; Chomsky, 1986; Hyams, 1986; Pinker, 1984). But if this is 
true, then why are children’s utterances so short? Why is there a 2-word stage at all? One 
answer--in fact, the only one ever proposed--is that while children represent the same sort of 
knowledge as adults, they have problems using this knowledge, some sort of processing 
bottle-neck. To put it another way, the only way to coherently hold on to the view that 
children represent adult-like grammars is to suppose that what they say is not an adequate 


1] am grateful to Jane Grimshaw, Steven Pinker, Virginia Valian, Ken Wexler, and Karen Wynn for their very 
helpful comments. This work was supported by a National Science Foundation Graduate Fellowahip. Send 
correspondences to: Paul Bloom, Department of Brain and Cognitive Sciences, £10-105, Massachusetts Institute of 
Technology, Cambridge, MA 02139. E-Mail: bloom@psyche.mit.edu 
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reflection of what they know. 
On the empirical side, there is considerable evidence for processing limitations in child 


language. For one thing, length limitations show up even when children imitate adult 
speech. The length of a young child’s imitation of an adult sentence is not predicted by how 
long the adult sentence is, but rather by how long the child’s spontaneous utterances tend to 
be (Brown and Fraser, 1963). This hints that the reason children’s utterances tend to be 
short has nothing to do with their grammars, but is due to a general inability to utter long 
strings of words. 

Further, children omit not only subjects, but also direct objects, indirect objects, verbs, 
locative arguments, and so on. In fact, much of the early debate over missing constituents in 
child language concerned sentences without verbs (Bloom, 1970; Bowerman, 1973; Braine, 
1974; Brown, 1973). As Brown (1973) notes, often children appear to be producing two- or 
three-word subsets of longer sequences. Instead of saying J put the book on the table, a child 
might say J put or put book or put table, and so on. The most parsimonious explanation of 
such utterances should account for all the omissions in child language--not just missing 
subjects. 

Finally, Mazuka, Lust, Wakayama, and Snyder (1986) point out that some children go 
through a stage where they neither include the subject nor do they omit it. Instead, they 
reduce it to a schwa (see Bloom, 1970). This would follow if children have difficulty uttering 
subject NPs but know they are required and thus make some effort to produce them. This 
behavior is entirely mysterious from the standpoint of the pro-drop hypothesis, which 
predicts that children will either include the subject or omit it. 

While all of this is suggestive, it hardly makes for a knock-down argument in favor of 
the processing theory. Therefore it becomes interesting to try to compare the pro-drop theory 
and the processing theory more directly. One way to do this is as follows: 


Syntactic complexity and subjectiess sentences 

If subjects are omitted because of processing difficulties we would expect them to be 
omitted more frequently from longer structures than from shorter ones. Therefore, the 
subjectless sentences that children produce should tend to have longer VPs than their 
sentences with subjects, because long VPs exert more of a processing load than short VPs. 

This prediction was first tested by Bloom (1970), who studied a 22-month-old child’s 
use of one verb -- make. She predicted that subjects should be omitted more frequently with 
long VPs, so a child would be more likely to oiit the subject if the VP was something like 
make me a cake, than if it was make cookie. Bloom found 45 sentences with the verb make, 
13 with subjects and 32 without. The mean lengths of the VPs were 2.77 and 3.25 
respectively, a significant difference (p < 0.05, one-tailed). 

A few years later, Braine (1974) performed the same sort of analysis on the 
spontaneous speech of two children, one acquiring English (Jonathan), the other acquiring 
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Hebrew (Odi). He found no significant differences in the mean length of VPs for subjectless 
sentences versus sentences with subjects. He concluded that there is no length limitation on 
children’s language production, which runs counter to Bloom’s finding and apparently 
refutes the processing theory. However, there are certain problems with Braine’s study that 
make accepting his conclusion premature. 

First, some of the utterances that he counted as VPs did not actually include a verb, 
but were instead “marked by the content of the utterance as including actions”. It is not 
clear whether these should have been included. Presumably many of them--particularly 
those that were only one word long--did not actually require 3 subject at any 
representational level. Therefore including such utterances might have spuriously lowered 
the mean length of sentences classified as “subjectless VPs". 

A related problem is that Braine included requests, statements, and questions in his 
analysis of Jonathan's speech and statements and questions in his analysis of Odi’s speech. 
But some requests and questions do not require subjects, such as give that to me/ and want a 
cookie?. As such, they have a different status than VPs where the subject actually has to be 
there and are irrelevant to both the pro-drop hypothesis and the processing hypothesis. 

Finally, adult Hebrew does allow for null subjects in some contexts, and therefore some 
of Odi’s subjectless sentences may actually be pro-drop utterances. None of the arguments 
against the pro-drop hypothesis concern children’s subjectless sentences in languages where 
such sentences are acceptable; the interesting debate is over the status of subjectless 
sentences that are unacceptable in the adult grammar. 

In light of these problems, I decided to do an analysis similar to what Braine did, using 
a broader data base and controlling for the problems mentioned above. 


Analysis 

Subjects 

The subjects were three children studied by Brown (1973): Adam, Eve, and Sarah. 
Transcripts of their speech are stored in computer text files as part of the CHILDES data 
base (MacWhinney and Snow, 1985) and a computer search program was used for all 
analyses. Adam’s speech was studied from 10 2-hour samples taken from the ages of 2;3 to 
2;7, Eve's speech was studied from 10 2-hour samples taken from the ages of 1;6 to 1;10, and 
Sarah's speech was studied from 20 1-hour samples taken from the ages of 2;3 to 2;7. 
Procedure 

The hypothesis is that children’s subjectless sentences will tend to have longer VPs 
than sentences with subjects. One necessity when doing such an analysis is to exclude 
subjectless sentences that are in fact acceptable in the adult grammar, such as imperatives 
and some questions. Because of this, only utterances with two types of verbs were used. 
These were (i) past-tense verbs, which cannot be used as requests or imperatives (e.g., 
wanted), and (ii) verbs that denote cognitive states or involuntary acts (e.g., need). This 
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second type will be called “non-imperatives”, since they can almost never appes: in the 
imperative form. There were & total of 48 past-tense verbs and 20 non-imperatives used in 
the search.? 

For the analyses below, questions, statements with no or don’t, statements where the 
verb is part of an embedded clause, and rote imitations of adult speech were not included. 


Results 

Each child’s utterances were analyzed separately for the two verb types, through 
one-tailed t-tests comparing the VP-length between sentences with and without subjects (see 
Figure 1 at the end of the paper). In all cases but one, the difference was statistically 
significant. The exception was Sarah [Past-Tense Verbs], although the difference was in the 
right direction, it was insignificant (possibly as a result of the low sample size). When the 
two verbs types were counted together, however, there was a significant effect for each child, 
including Sarah. These results strongly co.:firm the predictions of the processing theory. 


An alternative theory of the length difference 

There is another explanation of the length difference that is worth considering, one 
consistent with ‘!:e pro-drop hypothesis. Children may omit a subject only when they believe 
its meaning can be inferred by the listener from context. If long VPs supply more of the 
relevant context than short VPs, this would explain why subjectless sentences tend to have 
longer VPs than sentences with subjects. 

We can compare this explanation and the processing account in the following way. 
Suppose some of the children’s utterances have long subjects (e.g., the big mean lion) and 
others have short subjects (e.g., you). The processing account predicts that the former class 
of sentences should have shorter VPs than the latter, since a long subject imposes more of a 
processing load. In sum, we would predict a gradual decrease in the length of the VP as a 
function of subject size, as shown in (2). The pragmatic hypothesis, in contrast, predicts no 
difference between overt subjects of different . rgths, so long as they all have unambiguous 
reference. Thi is shown in (3). 


(2) Processing theory--predictions about VP-length 
no subject > short subject > long subject 


(3) Pragmatic theory--predictions about VP-length 
no subject > short subject = long subject. 


Unfortunately, children at the ages where they omit subjects rarely produce subjects 


*These were taken from an exhaustive list of verbs previously compiled from the speech of Adam, Eve, aud Sarah 
by Michelle Hollander, as part of an unrelated study. 1] am gratefii) to her for providing them to me. 
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that are more than one word long. But we can compare the theories by contrasting pronoun 
subjects with non-pronoun subjects. Since pronouns are phonetically shorter than 
non-pronouns, the processing theory predicts some difference in VP-length as a function of 
whether or not the subject is a pronoun. As long as both the pronoun and non-pronoun 
subject are unambiguous, no such prediction would come out of the pro-drop hypothesis. 

The prediction was tested using the data compiled above. Since it is important that all 
subjects in this analyses be unambiguous, the only pronouns included were 7 and you, since 
pronouns like she or they are often ambiguous and could require a longer VP because of this. 
The analysis was done collapsed over verb types; the results are shown in Figure 2 at the end 
of the paper. 

The length of the VP clearly decrease as a function of the size of the subject. Contrast 
analyses testing for a linear trend in VP-length as a function of subject size showed a 
significant effect for each of the three children. 


Why are subjects omitted more frequently than objects? 

Finally, I want to briefly consider the question of why subjects are omitted more 
frequently than objects. Across the three children, 55% of their declarative sentences have 
m‘ssing subjects. In order to calculate the proportion of missing objects, we have to look only 
at contexts where verbs must take an obligatory object. This can only serve as an estimate, 
because it’s not at all clear whether the adult intuition about which verbs take obligatory 
objects is going to be the same as the child's. 

Nevertheless, when we do the analysis, it turns that children omit the object a total of 
9% of the time, which is surprisingly high according to some accounts, but also significantly 
different from the proportion of subject omission. Every child omitted objects some of the 
time, and every child omitted subjects more frequently than objects (see (4)). 


(4) Omission from obligatory contexts 


Adam Eve Sarah Total 
SUBJECTS: 57% 61% 43% 55% 
OBJECTS: 8% 1% 15% 9% 


If the subject/object difference is due to a processing asymmetry, we should expect to 
find other differences between subjects and objects. For one thing, given that pronouns don’t 
exert much of a processing ijoad, we would expect them to be more frequent in subject 
position than in object position. This seems to be the case -- for each ch‘ld, there is a greater 
proportion of pronoun subjects than pronoun objects (see (5)). Another prediction is that 
non-pronoun subjects will be shorter in length than non-pronoun objects, a difference that 
also occurs (see (6)). When we sum up over the three children, both of these differences are 
highly significant. 
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(5) Proportion of overt NPs that are pronouns 


Adam Eve Sarah Total 
SUBJECTS: 41% 36% 91% 51% 
OBJECTS: 25% 14% 33% 24% 


(6) Mean Length of non-pronoun NPs 


Adam Eve Sarah Total 
SUBJECTS: 1.18 1.26 1.00 1.16 
OBJECTS: 1.43 1.63 1.48 1.59 


It’s worth stressing that these are all independent analyses; just because subjects are 
omitted more frequently than objects, it does not follow that pronominal subjects will be 
more frequent than pronominal objects, or that non-pronoun subject N' will be shorter than 
non-pronoun object NPs. In fact, these two other differences are a mystery from the 
standpoint of the pro-drop hypothesis. The most natural way to explain all three effects is in 
terms of processing load; there are more resources available for the end of the sentence than 
for the beginning. As a result of this processing asymmetry, subjects are omitted more 
frequently than objects, pronouns are more frequent in subject position than object position, 
and subjects tend to be shorter than objects. 


Discussion 

Once we have an alternative explanation for why children omit subjects, there is no 
independent reason to hold onto the pro-drop hypothesis. In fact, the position that children 
acquiring English represent pro-drop grammars until they are about two-and-a-half or three 
leads to a host of problems. For one thing, you need some sort of account of why the child 
goes so long without switching to the adult: grammar. Some the-..ists appeal to neural 
maturation or “selective attention” as explanations for why the pro-aiop stage lasts so long. 
While these proposals are logically possible, they are ad hoc, and have little independent 
support. Furthermore, there is the problem of determining exactly what information causes 
the pro-drop to non-pro-drop shift. To date, none of the proposals of what causes the 
parametric switch have met with convincing empirical support. 

Finally, the alternative view, which is that all children start off with non-pro-drop 
grammars, runs into none of these probiems. Under this theory, children initially represent 
overt subjects as obligatory (ay in English) and only when hearing subjectless sentences do 
they change their grammars to pro-drop (as in Italian). It turns out that 2-year-olds 
acquiring Italian omit subjects far more frequently than 2-year-olds acquiring English 
(Valian, 1989), which suggests that the switch from non-pro-drop to pro-drop takes place 
very early in the development of a child learning a language like Italian. 

The hypothesis that all children initially represent pro-drop grammars has led to some 
very interesting theoretical and empirical speculation. However, the data fail to support this 
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hypothesis. Instead, it appears that children acquiring English omit subjects because of a 
processing limitation on language production and that all children initially represent 
non-pro-drop grammars. 
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ACQUIRING LANGUAGE IN A CREOLE SETTING: 
THEORETICAL AND METHODOLOGICAL ISSUES 


Lawrence D. Carrington 
University of the West Indies 
St. Augustine 


The challenge of acquisition in Creole environaents 


Creole environaents offer an opportunity for the study of language 
acquisition in settings which contrast strongly with those in which 
mainstream thought in language acquisition studies has been establish- 
ed. In Creole environments, the target of the learner is ill-defined 
both because of the intensely variable nature of the input and because 
of the absence of exactly pertinent grammatical descriptions. These 
circuastances present a theoretical and aethodological challenge to the 
analyst, viz. the deteraination of the true target of acquisition, the 
nature of acquisition in the face of such a variable environaent as 
well as the internalization of control of seaningful variation by a 
learner. This paper will elaborate on the challenge and suggest a 
method for developing a corpus for study in such environments. 


The usual environment for acquisition studies 


Although variation is present in all language acquisition sett- 
ings, it has not been a purposefully included conditioning factor of 
the sainstreaa study of language acquisition. Orthodox knowledge of 
language acquisition has been established by studying children who are 
exposed to a limited nusber of previously described language varieties 
(and preferably one) acdelled by formally educated aainstreas users. 
The zhild is supposed to be acquiring a specified language for which an 
asple referential description is available. These descriptions have 
tended to be of the static type in which variation is a footnote rather 
than a detereainant of the description. By contrast with Caribbean 
sociolinguistic complexes, such homogeneous environsents aay be teraed 


sterile. 


Types of learning environaents in the Caribbean 


The Caribbean sociolinguistic complex is a rich environaent which 
obliges the analyst to cope with variation in auch the sage way as the 
child learner. Several types of sicro-settings @ay be identified. 


Type I. Consistent aonolingual The consistent eaonolingual 
environaent is the classic monolingual environgaent and it is atypical 


of the Caribbean. Speakers would interact in the presence of the 
learner and with the learner in a single code. Shifts of register, 
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style or situational variety would be linguistically within a single 
graseatical systea. In strict teres, available audio-visual aedia 
would be in the variety used by the speakers in the accessible envircn- 
ment. 


Type IJ. Leaky monolingual Much sore coason would be the leaky 
gonolingual envirensent in which those who interact with the learner 
produce a single code, but the learner is exposed in addition to 
another code from audio sources lacking the physical presence of a 
speaker or other pragmatic context for interpretation. 


Type If}. flaonolingual with secondary input In the third kind of 


envircnaent, aonolingual with secondary input, another code is present 
in the environsent and is supported by prageatic contexts although the 
jsaediate caregivers do not thesselves use it. The isportant differ- 
ence between this setting and the Type II setting is that the secondary 
language is overheard in contexts that have pragesatic support. This 
type of environment is auch aore prevalent than the Type I environaent. 


Type IV. Special case in sulti-code environment The fourth type 


of micro-setting may be summarized as one in which the learner is a 
special addressee. The iesediate socializers have sore than one 
variety available but use only one with the learner in keeping with a 
household decision about what language they wish the learner to 
acquire. 


Type V. Routine case in aulti-code environment In the fifth 
case, the routine case, the learner is addressed in only one of the 


available codes of the socializers in keeping with a general coanunity 
convention Cas apposed to the househola decision of type IV) tha* a 
specific variety is the appropriate one for use with children. 


Type VI, Qgen access In the open access case, the socializers 
command sore than on. code and the learner is not excluded from any of 
thea. The difference between Type VI and Type V aay be isportant at 
later stages of acquisition when reported asyasetries in child-parent 
comgaunication have the effect of obliging children to use varieties 
closer to the standard than those used by their parents. 


Obviously, one can study acquisition in any setting but the high 
frequency and comsonplace nature of the Type V/VI environaents recos- 
aends thes as prisary for study in the Caribbean sociolinguistic 


complex. 


Linquistic repertoire vs Language 


Within the above eicro-settings, several different factors aay be 
responsible for variation including the existence of a creole dialect 
continuus and the practice of code switching. The notion of a creole 
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dialect continuya has dominated the literature on Caribbean speech 
varieties for nearly 2 decades. Its characterization, analysis and 


exeaplification in the work, inter alia, of De Camp (1971), Bickerton 
(1975) and aost recently Rickford (1987) attests to the virtual 
inevitability of variable data in Caribbean environaents. Current 
continuus theory treats drift across lectal boundaries within a aulti- 
dimensional sociolinguistic space which is presented as a single 
interlocked systea. Variation is intrinsic to such a construct. 


Code switching in response to established social cues can be noted 
as a contributor to the variation that is characteristic of the 
Caribbean sociolinguistic complex. Code switching aay also result froa 
a speaker’s inability to complete a comsunication in a given code 
because of a break in coapetence. Thus, although a conventional 
analysis of a speaker’s behaviour aay assign parts of his perforeance 
to different language systeas, his speech behaviour say constitute a 
single syster of comaunication within the relevant Caribbean society. 


In these Circuastances, the Salience of variation challenges the 
notion that a learner is acquiring a_lanayage, a pre-existent entity. 
A large propor’ion of children acquire a speech repertoire that aay 
include varieties that cannot be unequivocally ascribed to a single 
language. Rather than acquiring a__language, these learners would be 


acquiring a Jinquistic repertoire that allows them to interact within 
their societal range. We therefore need to focus on the idea “linguis- 


tic repertoire” rather than the idea “a language”. The point is 
critical for shaping relevant field procedures and analytical process— 


ese 


Pinning down the variable target 


The first challenge for the study of acquisition in @ Caribbean 
socio-linguistic complex is the detersination of the target of the 
child given that variation is present and influential at the level of 
the individual household. Information from the environaent of an 
inforraant P illustrates the extent to which a Single household can 


offer variable exposure and output. 


Recorded between 2;8 and 330, P is the second youngest of 7 
children in a household. Her mother CJJ, a teenager, her grandaother 
C43, alaost 40, and her grandmother's husband CRI, late 40s, show very 
different varieties of the spectrua of possible speech in rural 
Trinidad. Her grandaother had secondary level education up to the Sth 
Fors (approx age 163 in a semi-urban settings her granJaother’s husband 
has had limited elementary schooling in 2 rural setting; her aother 
left secondary school in their village setting at Fora 2 (approx age 
133 four of the other children in the household are at school. 
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Exasples of the variation to which the learner is exposed are 
presented in the appendix. Drawn from the sage 30 ainute recording, 
these exasples of input and aebient language illustrate variation in 
pronoginal selection, tense-aspect aarking, and fora of a locative 
question word in the speech of the child’s grandfather, grandaother and 
aother. In an. environaent of such diversity, it becoses iaportant to 
be able to identify the direction of movement of the inforaant. 


Let us assuee that within the household Concept C is expressed by 
variants Vs, Va, Va «-- Vn (where the variant is a fora, eleaent or 
structure). The learner at some initial point say be recorded as 
producing a form F:. If we are to track sovemsent towards a target we 
need to ascertain which of variants Va ... Va is being represented by 
F,- It will be frequently easy, sometiees difficult and at other times 
impossible to tell. However, it is pertinent and isportant to atteapt 
to determine which variant the household considers that the learner is 
targeting or ought to be targeting, since it is ultiagately their 
response to her output that will contribute to its realignaent in the 
socially acceptable direction. 


Working without a qraamatical description 


The second challenge for the study of acquisition in such environ— 
sents arises froma the fact that no existing descriptions of the systess 
of comaunication would allow the analyst to have a predeterained 
reference point for the target of the learner. The disadvantage of 
this circuastance is aore apparent than real. The fact is that there 
can be no valid description of a learner’s target unless that descrip- 
tion is derived from the interactions of the learning environeent. A 
corpus created in accordance with that principle would have the charac -— 
teristics of being a valid representation of actual input and aabient 
language, permitting focus on the relationship between fora and 
function as the child perceives it, and ensuring that judgsents of 
grammaticality and acceptability are based on data actually available 
to the learner rather than on a graasar that is hypothetical as far as 
the learner and his/her iasediate environaent are concerned. 


This position dees not deny the value of the already published 
analyses of Caribbean language varieties; it places thea in a different 
perspective, a function of ultisate reference rather than assumed 
target of the learner, The true target of the learner can be defined 
and described realistically only by exsaining the available rather than 
the purported input and ambient language. The corpus for study wouid 
be created by the recorded interaction between the learner and the 
participants in his exposure to comaunication. Both the description of 
the target comaunication system (i.e. the community language) and the 
description of the acquisition process must come fros those data. 
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Satisfactory acquisition is the fulfilleent of noras expected by 
the environaent; hence, it is necessary to determine which variants the 
household wcnsiders that the learner is targeting or ought to be 
targeting. In this regard, the participants in the interactions 
recorded with the child can be invited to state what they consider the 
learner to have said and thus provide access to their perception of 
grameaticality. This procedure would be Similar to the elicitation of 
repairs which has been applied to other purposes including tests of 
children’s understanding of various concepts and structures. In this 
instance, it can provide knowledge on the expectations of speakers of 
the target varieties and allow establishaent of one aajor criterion of 
satisfactory acquisition. In addition, it would enhance our knowledge 
of variation by providing indices of the expectations of the adult 
users of the systea of communication. 


Acquisition of acceptable control of variation 


In Caribbean sociolinguistic complexes, variation is not only 
diagnostic of speaker history but also functional within cosgs@unication 
acts. Hence, part of what a learner needs to acquire is control over 
socially appropriate variation. Children acquiring language in a 
Caribbean sociolinguistic complex need to learn the different sets of 
linguistic behaviours that are acceptable within the same conaunity 
context. This aay not differentiate these environments froa other 
learning contexts but there is a potentially aore compelling considera- 
tion that aakes attention to this detail iaportant. 


Qf special relevance to Caribbean sociolinguistic complexes is a 
conclusion of Le Pagr and Tabouret-Keller (1985) on the evolution of 
speech behaviour and self-identity which can be paraphrased as follows: 


Individuals create their system of verbal behaviour to 
reseable that of the groups with which they wish to identify, 
subject to the constraints of their ability to identify the 
groups, the strength and clarity of their sotivation, the 
adequacy of their opportunities for learning and their 


ability to learn. 


Their work adequately supports this view and it is clearly 
‘yelevant to choices in variable socio-linguistic space. One cannot 
study language acquisition in these environsaents without including the 
acquisition cf variable behaviour and of knowledge on tolerance of 
difference within the speaker’s ismediate environment. The child 
learns how to raintain group meabership, and how to sanipulate variants 
without threatening desired relaticnships. It is these skills which 
control the shift behaviour that has been discussed repeatedly in 


respect of continua. 
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Summary 


The study of language acquisition requires that the analyst 
identify the learner’s target and have available a grammatical descrip- 
tion of the target. In the case of the Caribbean Creole environaent, 
special caution is required in identifying the learnrer’s target because 
substantial variation is intrinsic to the input and ambient language. 
Existing studies of language varieties in the region aust not be 
assused to provide appropriate descriptions of the input or ambient 
language for a particular learner. Both the description of the 
language to be acquired and the account of the process of acquisition 
aust be based on the saee corpus produced in the learning environment 
by the learner and the participants in his/her language socialization. 
The corpus can be enhanced by eliciting from the participants their 
notion ef what the learner is atteapting to say and ought to be saying. 
This procedure is iaportant for providing access not only to notions of 
gramaaticality but also to notions of appropriateness. Given the 
interlock between speech behaviour and identity in these settings, 
attention has to be devoted to the acquisition of ability to sanipulate 
socially linked variants of a variable. 


REFERENCES 


Bickerton, Derek 1975 Dynamics of a Creole Systen, Cambridge: 


Caabridge University Press. 


De Caep, David 1971 ‘Toward a generative analysis of a post-creole 


continuum’. In D. Hymes, Pidginization and_crealization of 
languages, Casbridge: Cambridge University Press. pp 349-370. 


Rickford, John 1987 Disensions of a Creole Continuus: Historye Texts. 


& Linguistic Analysis cf Guyanese Creole, Stanford: Stanford 


University Press. 


78 


70 


: sa . 7 . : z ; - Pe x : 7 - Z = 
. La " a gb bee se os ? a bo oe 


A 


— 


71 


APPENDIX 


Exaaples of variation in input and asbient language 


Age of child 238 Bate 4th February 1988 


Feature: Pronominal fora 


Utterance # Speaker Utterance 
13 grandfather 2 git tex rt of fraw fi bike 
(general, about bo Gu bad. 
child, in pres~- I’1] take it away from her 
ence of child) because she’s too naughty. 

16, 17 grandaother ick «a Kamin, let a kaw 2a, 
(to child in ref- Let her come in, let her 
erence to sibling) come in. 

138 | qrand father " 
(to child, message tel ha Avk (ud fr wri, 
to be canveyed to Tell her to cook one for ae. 
aother) 

244 grandfather tel $1 Kam owt! 


(to child, in ref- Tell her to come out. 
erence to sibling) 


Feature: WH locative 


Utterance 8 Speaker Utterance 
67 sother We iw gorn: 
(to child) Where are you going? 
en wt vw Aipin jw > 
80 grandfather i“ wetlpel J jong” Krean ° 
{to child) And where would you keep = your 
crayons? 


Feature: Tense-aspect sarking 


Utterance # Speaker Utterance 
> 
23 grandeother hu gev jX dat prif{a: 
{to child? Who gave you that picture? 
30 grandaother hi giv Ju at tfe a 
(to child) Who gave you the chair? 
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CHILDREN’S KNOWLEDGE OF RELATIVE SCOPE IN CHINESE 


Yu-Chin Chien Kenneth Wexier 
California State U. San Bernardino Massachusetts Institute of Technology 


In this paper, we present some results of an experiment which was 
designed to investigate how Chinese children and adults would interpret 
sentences containing universal quantifiers and existential quantifiers 
such as the examples given in (1) to (4). €(1) & (2) are canonica} 
constructions; (3) & (4) are Chinese bg-constructions.] 

(1) Draw a/every figure in every/a box. 

(2) In a/every box draw everv/a figure. 

(3) Ba a/every figure draw in every/a box. 

(4) Ba a/every box draw every/a figure. 

According to current literature (c.f., J. Huang,1982; Lee, 1986; Aoun 
& Li. ta appear), the scope facts concerning two quantifiers ‘n a simple 
sentence do not hold across languages. In English, it is generally 
agreed that the scope relation of quantified noun phrases is free within 
the minimal sentence and thus allows various scope ambiguities. However, 
in Chinese, no such free property is attested. In many cases, Chinese 
sentences are strictly unambiguous. This cross-linguistic difference in 
scope relations is exhibited by the English example given in (5) and its 
Chinese counterpart given in (6). 

(5) Every child sang a song. 
(6) Meige xiaohai dou chang le yishou ge. 
every-CL child all sing Asp. one-Cl song 
"Every child sang @ song." 
Sentence (5) “every child sang a song” is two-way ambiguous. It can mean 
that different children sang different songs where the universal 
quantifier NP (Q-NP) “every child" takes wide scope over the existential 
quantifier NP “a song". This sentence can also mean that the children 
all sang the same song where the existential Q-NP “a song" is said to 
take wide scope over the universal Q-NP “every child”. The equivalent 
Chinese sentence given in (6), however, has only one meaning. It 
corresponds to the wide scope reading of the universal Q-NP “meige 
xiaohai”. Namely, different children might sing different songs. 

A rule of Quantifier Raising [as given in (7)] and two general 
conditions on Quantifier Raising were proposed by May (1977, 1985) to 
explain the scope ambiguity of sentence (5) and many other 
quantificational sentences. He argued that there was a level of Logical 
Form (LF) in syntax where generalizations concerning quantificational 
phenomena such as scope relations could be captured. In an LF 
representation, if one quantified NP c-commands the other quantified NP 
then the c-commanding one takes wide scope over the c-commanded one. The 
notion of “c-command“ may be cee in the way stated in (8). 

7) Quantifier Raising Rule: 
Chomsky-adjoin a quantificational NP to S. 


A c-commands B iff A does not dominate B and the first branching 
node which dominates A also dominates B (c.f., Reinhart, 1976). 
By applying the rule of Quantifier Raising (QR) to the S-structure 
representation, the two quantified NPs in sentence (5) could be freely 
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moved to adjoin to the S$ node, successively. Since there are no other 
conditions on the application of the QR rule nor other constraints on the 
output after applying this QR rule, an S-structure Tike (5) can be 
transformed into two well-formed LF representations as given in (9) and 


10). 
7 (9) [Every child], [[a song], [x, sang XIe]g}s] 


(10) {{A song), {{every child], [x, sang XAg}sls] 


In the LF resresentation (9), the universal Q-NP “every child" 
c-commands the existential Q-NP “a song". The universal Q-NP thus takes 
wide scope over the existential Q-NP and thus implies that "different 
children might sing different songs". In (10), on the other hand, the 
existential Q-NP “a song" c-commands the universal Q-NP “every child". 
The existential Q-NP thus takes wide scope over the universal Q-NP and 
implies that “the children all sang the same song”. 

There exist at least three proposals expressly designed to explain 
the scope facts of Chinese sentences. Based on a wide range of data, 
Huang (1982) examined quantifier scope in Chinese. He claimed that while 
the rule of Quantifier Raising and the notion of structural c-command 
were both relevant in the determination of scope relations in Chinese, 
the application of the QR rule in Chinese was not as free as that 
observed in English. In order to interpret the scope phenomena of 
Chinese sentences and to explain the contrast between Chinese and English 
as shown in examples (5) and (6), Huang proposed a general condition on 
scope interpretation for Chinese (1982: 220). This general rule [also 
known as the Isomorphic Principle], as given in (11), states that, for 
Chinese quantificational sentences, if a quantified NP A c-commands 
another quantified NP B in Surface-Structure, this quantified NP A will 
also c-command the quantified NP B in Logical Form. In other words, 
throughout the process of SS to LF mapping performed by the QR rule, the 
c-command relationship between two quantified NPs at S-Structure is 
preserved at Logical Form. 

(11) Huana’s General Condition on Scope Interpretation in Chinese 
“Suppose A and B are both QPs or both Q-NPs or Q-expressions, 
then if A c-commands B at S-Structure, A also c-commands B at 
Logical Form” 

Consider the Chinese sentence (6) again. At S-structure, the 
universal Q-NP “meige xtaohai (every child)“ c-commands the existential 
Q-NP “yishou ge (one song)". According tc the general constraint stated 
in (11), the c-command relation between these two quantified NP will stay 
the same in LF (after the application of the QR rule). As a result, 
sentence (6) is not ambiguous. 

A slightly different proposal was made by Lee (1986). Following 
Huang’s proposal, Lee argued that the hierarchical relation between two 
quantified NPs in a sentence was relevant for the determination of scope 
relations in Chinese, but instead of “c-command", the relevant 
hierarchical relation should be expressed in terms of “g-command". In 
addition, Lee claimed that both the notion of linear order and that of 
hierarchical relation, namely g-command, were relevant to scope 
interpretation in Chinese. By incorperating these two notions, Lee 
revised Huang’s general condition for scope interpretation in the 
following way [see (12)}: 

12 ‘s Gen ondit In r j 
Given two quantified NPs A and B 
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i) if A asymmetrically g-commands B, A will have scope over B; 
ti) if A and B g-command each other, then A can have scope over 
B only if A precedes B (1986: 187). 
[A g-commands B iff the node representing the governing 
category of A dominates B, where "governing category for an 
element A" is defined as "the minimal category that 
contains A and a SUBJECT".] 

In (6), the governing category for the universal Q-NP “every child" 
is the whole sentence which also serves as the governing category for the 
existential Q-NP “a song". Accordingly, these two quantified NPs 
g-command each other, and therefore the preceding NP “every child” takes 
wide scope over the succeeding NP “a song". Since in Chinese sentences 
such as (6), the hierarchical order of c-command is confounded with that 
of g-command and linear precedence, additional data besides sentence (6) 
are required to evaluate these two analyses suggested by Huang and Lee. 
[We postpone the discussion of t*e related data until] the experimental 
design and the cutcomes are exarsined. ] 

Another analysis which was proposed to account for the scope 
phenomena of Chinese was introduced by Aoun ard Li (to appear). They 
challenged Huang and Lee’s Isomorphic Principle by showing that there 
were instances in Chinese which did not exhibit this isomorphic effect. 
We will not discuss Aoun and Li’s proposal in detail, but just want to 
point out that their analysis and Huang’s analysis predict exactly the 
same results concerning the sentences tested in our experiment. In order 
to differentiate Aoun & Li’s theory from Huang’s theory, one has to 
consider Chinese passive constructions. 

The purpose of our 2xperiment was to get preliminary evidence on 
whether Chinese chiluren understand scope reiations and whether they know 
which relations are possible for particular syntactic configurations. 

For the most part linguists agree on the judgments of scope relations. 
However, in the one case (tested in our experiment) where Lee’s mode] 
disagrees from Huang’s there appears to be not complete agreement on the 
adult judgments. Therefore, in addition to child subjects we a?so tested 
adult subjects. Their judgments regarding this one controversial case 
were carefully examined. Moreover, by examining adults’ data, the 
validity of the experimental method was assessed. Since the relationship 
between behavior and scope interpretation is particularly comp] icated 
(see our later discussion on “accidental” narrow scope), adult data is 
particularly useful when studying scope. For an earlier discussion of 
the acquisition of scope in Chinese, see Lee (1986). 


EXPERIN 

In the experiment, an act-out task was used to test Chinese-speaking 
children and adults’ interpretation of sentences involving two 
quantificational NPs. The subject was first presented with a sheet of 
paper with an array of three equally sized squares and a card with an 
array of three di‘ ferent figures (or numbers), or a set of three markers 
of different colors. The subject was then presented with a test sentence 
(e.g., “Draw every figure in one box") and asked to perform the action 
prescribed in the presented sortence. An example of the jayout of the 
experimental materials is illustrated in (13}. 
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Sixteen different types of experimental sentences were included fn 
this study. Half of the experimental sentences were canonical sentences 
fexamples are given in Table 1 and Table 2]. The other half were 
Ba-constructions such as those illustrated in (3) & (4) above [which wil] 
not be discussed in this paper]. According to their syntactic 
structures, we classified the canonical sentences into two major groups. 
Group 1 consisted of four constructions as shown in (15) & (16) [see 
Table 1] and (21) & (22) [see Table 2]. These four constructions shared 
the structural representation given in (14) [which is repeated in (20)]. 
Group 2 consisted of another four constructions as shown in (18) & (19) 
[see Table 1] and (24) & (25) [see Table 2}. They shared the structural 
representation illustrated in (17) [which is repeated in (23)]}. 

Let us first examine the structural configuration given in (14) 
[=(20)]. Following Huang’s analysis (1982), if we assume that c-commaid 
can be relaxed to allow for an NP object of a preposition to c-command 
across a dominating PP node, then, in (14), NP, c-commands NP,, but 
not vice versa. According to Huang’s scope principle given in (11), NP, 
should take wide scope over NP,. In this configura*ion, NP, and 
NP, share the same governing category, namely the S node. Thus, 
arcording to Lee, these two NPs g-command each other. Applying Lee’s 
scope principle (given in 12) to (14) then, the preceding NP (i.e., NP.) 
should have wide scope over the succeeding NP (i.e., NP). As mentioned 
earlier in this paper, Aoun and Li’s analysis makes the same prediction as 
Huang’s analysis for sentences with structure (14). According to Kuang 
and Aoun & Li, NP, (i.e., y box) should take wide scope over NP, 

(i.e., x figure). Lee’s analysis makes the opposite prediction. 

Now consider the structural configuration given in (17) [=(23)]. In 
this structure, NP, c-commands NP,, but not vice versa. According 
to Huang, NP, takes wide scope over NP,. With regard to the notion 
of g-command, again, NP, and NP, g-command each other. In this 
case, since NP, precedes NP,, according to Lee, NP, should 
have wide scope over NP,. Considering the structure given in (17), a 
converging prediction may be derived via all three analyses mentioned. 

As can be seen from the examples given in Table 1 and Table 2, besices 
the configurational factor, we also varied the types of quantified NPs 
occupying the two objecc positions in each sentence. We included two 
types of quantified NPs in this study: the universal Q-NP such as “every 
box" or “every figure” and the existential Q-NP such as “one box" or “one 
figure”. In some sentences, the two quantified NPs were of the same type 
(e.g., sentences in Table 2); in other sentences, these two NPs were not 
of the same type (e.g., sentences in Table 1). Taking the order of the 
two quantified NPs into account, four possible combinations of these two 
types of quantified NPs were established: the “every-every” condition, the 
“every-one” condition, the “one-every” condition and the "one-one" 
condition. In order to facilitate comparisons among these conditions, we 
have included only one set of test sentences as examples here. However, 
jn the real test conditions, three sets of test items were included. [One 
with the verb “hua (draw)" and the direct object NP "tuxin (figure)", one 
with the very "xie (write)" and the direct object “suzi (number)*, and the 
final set with the verb “tu (mark/color)" and the direct object “yanse 
(color)".] Addition of the ba-sentences yielded a total of 16 test items 
in each set and a total of 48 test items for each subject. The three sets 
of test items were randomly given to each subject. 

One hundred and ninety-two children between the ages of 3 and 10, and 
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42 adults were tested. The child subjects were randomly sampled from 
preschools and elementary schools in Taipei Taivan. The adult subjects 
were undergraduate students attending National Chengchi University in 
Taipei. The children were divided into seven age groups in one-year 
intervals (e.g., Gl: 3-4...&7: 9-10) with at Teas: 25 children in each 
group except the first two groups. 

fet us first examine our adults’ responses. The results concerning 
the eight types of quantificational constructions are summarized in Table 
1 and Table 2. When adult subjects were asked to "draw one figure in 
every box" [e.g., (15)], about 77% of the time, they assigned wide scope 
(WS) reading to the existential Q-NP "one figure” and drew the same figure 
in different boxes. About 19% of the time, they assigned WS reading to 
"every box” and drew different figures in different boxes. This result, 
at first sight, seems to follow Lee’s prediction but not the one provided 
by Huang or Acun & Li. However, note that a WS reading for “every” does 
allow th: response illustrated in (15a) where the same figure is drawn in 
each box. Nothing about the syntax or the scope assignment makes it 
necessary that a different figure be put in each box. And, in fact, Lee’s 
analysis predicts that no instances of (15b) be found, so it remains a 
question why 19% exist. Thus it seems that the results givan in (15a & b) 
are more consistent with Huang’s or Aoun & Li’s analysis than with Lee’s. 

Now, look at the result listed in (16). When adult subjects were 
asked to “draw every figure in one box", almost all of our subjects only 
assigned the WS reading to the existential Q-NP "one box” and drew al] 
three figures in a certain box. This result, on the other hand, does 
follow Huang or Aoun & Li’s analysis but not Lee’s. 

Let us examine the data listed in (18) and (19). When adult subjects 
received the instruction “In one box, draw every figure", with only very 
few exceptions, they assigned the WS reading to the existential Q-NP "one 
box" and drew all three figures in a certain box. This result is 
compatible with all three analyses provided by Huang, Lee and Aoun & Li, 
respectively. Now consider the data listed in (19). When adult subjects 
received the instruction “In every box, draw one figure", about 50% of the 
time they assigned WS reading to the universal Q-NP “every box" and drew 
different figures in different boxes. About 41% of the time, they 
assigned WS reading to the existential Q-NP "one figure” and drew the same 
figure in all three boxes. This result is compatible with all the three 
linguistic analyses under discussion because wide scope for the universal 
quantifier does not imply that there must be different figures chosen. 

What preliminary conclusions can be drawn from the data just 
examined? The hypothesis of g-command and }inear precedence (hereafter 
the linearity hypothesis), to a certain degree, was not confirmed by the 
data, while the c-command hypothesis was confirmed by the data. The 
experimental method seems to be valid. The question is why most of the 
adult responses in (15) had only one figure instead of three different 
ones. Tt seems possible that when more than one response is consistent 
with the syntactic analysis, the preferred response is affacted by 
non-syntactic factors or performance considerations (as in “backward” 
coreference). 

Let us examine the results of the sentences involving two universal 
Q-NPs [i.e., the every-every constructions listed in (21) and (24)] and 
those involving two existential Q-NPs [i.e., tie one-one constructions 
listed in (22) and (25}}. The only correct interpretation for sentence 
(21) and (24) is to draw all three figures in each of the three boxes. 
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As indicated, about 93% of adult subjects gave the correct interpretation 
to these two constructions. When adult subjects were asked to draw one 
figure in one box, about 96% of the time they drew a certain figure in a 
certain box and left two figures unused and two boxes empty. When they 
received the instruction “in a box, draw a figure", about 76% of the time 
they drew a certain figure in a certain box with two figures unused and 
two boxes empty. About 18% of the time, they drew different figures in 
different boxes. In other words, they assigned the generic reading to 
sentence (25) and interpreted the sentence as the following: “for every 
x, if x=box, in x, draw one y, yefigure". 

Children’s responses to the eight types of quantificational 
constructions are illustrated in the eight figures given in Tables 1 & 
2. As can be seen from the figure under (15), when children were asked 
to “draw one figure in every box", the WS reading was more frequently 
assigned to the existential Q-NP “one figure” than the universal Q-NP 
“avery box" (excepting groups 1 & 2). The response pattern exhibited by 
children, older than 5, follows the same trend observed in adults. The 
response pattern exhibited by children younger than 5, on the other hand, 
does not follow the same trend observed in adults. It should be pointed 
out that, in the present study, a high portion of our young children 
tended to give only one particular response to all the test questions 
they had received. Therefore, the set of data obtained from children 
younger than 5 should be interpreted with caution. 

When children were asked to “draw every figure in one box" (16), 
thoir response pattern, to a certain degree, seems to be different from 
the adults. Adults almost always assign wide scope to “one box", but 
even relatively old children (7 to 9) give almost as many responses with 
the 3 boxes involved as indicated in {16b). We speculate that the 
children might treat the PP in (14) as a sister node to NP,, so that 
NP. and NP, c-command each other. Thus either responses [i.e., 

(iba) & (16b)] will be possible. 

Let us consider the results illustrated in (18) and (19). As 
indicated in the figure under (18), when children received the instruction 
"in one box, draw every figure", most of the time, they assigned the WS 
reading to the existential Q-NP “one box" and drew al] three figures in a 
particular box. Again, children exhibited a response pattern very similar 
to the adults’ (excepting groups 1 & 2). When children received the 
instruction "in every box draw one figure", they assigned WS reading to 
the universal G-NP “every box" more frequently than the existential Q-NP 
"one figure” (excepting G7). Our Group 7 children attributed the WS 
reading to the universal Q-NP almost as frequently as the existential 
Q-NP. This response pattern exhibited by children, once more, follows a 
similar trend observed in adults. 

To summarize, if we look at the responses given by children older than 
5 (i.e., our group 3 to group 7 children), a parallel between children’s 
scone interpretation and adults’ scope interpretation was found when 
.-»tructions with two objects (an universal Q-NP and an existential Q-NP) 
were examined. The particular response distributions were somewhat 
different, but all the children’s behaviors were consistent with the adult 
syntactic analysis except (16) “Draw every figure in one box." Here we 
speculated that the children did not have the same phrase-structure as the 
adults. [Note that the analysis given in (17) is not necessarily the only 
one. Here the PP could be attached higher up. However, there is no 
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possibility that the children would attach the PP under ¥’ because 
complements in Chinese come on the right.] 

Let us examine how Chinese children will interpret similar 
constructions with two universal or two existential Q-NPs. The results 
are given in Table 2. As mentioned earlier in this paper, the only 
correct interpretation for sentences involving two universal Q-NPs is to 
draw all three figures in each of the three boxes. However, as indicated 
in the figure under (21) and (24), children gave very few responses 
corresponding to this correct adult interpretation. In many cases, they 
drew a figure in a box and another figure in another box until there was 
no figures left and no boxes unused. Our child subjects seemed to know 
the concept of "every N" and tried to establish a relation between the 
members of the two sets of elements mentioned. Nowever, instead of making 
one universal Q-NP enter the scope of another universal Q-NP, they 
assigned “sum of plurals" readings to (21) and (24). For example, a sum 
of plurals reading for (21) corresponds to the following statement "draw 
three figures in three boxes such that each of the figures is drawn and 
each of the boxes is drawn in." Az alternative interpretation to this set 
of results is that children may produce this strikingly different result 
because of some kind of response set, namely, they did not want to use any 
figure more than one time. Our intuition is that this is an unlikely 
interpretation, especially given the older age at which children still 
produced this result. Obviously future research is necessary. However, 
if this response pattern is upheld, and is seen to be a result of 
children’s syntactic knowledge (and not an artifact) then’ it seems that it 
is an impor’ant empirical discovery which calls out for theoretical 
explanatton. It might be central to a discussion of the acquisition of 
operators in child language. 

When child subjects were asked to draw one figure in one box, in most 
of the cases, they drew a certain figure in a certain box and left two 
figures unused and two boxes empty. When they received the instruction 
"in one box, draw one figure", the same response pattern was observed. 
They drew a certain figure in a certain box with two figures unused and 
two boxes empty. Similar to the adult subjects, in some cases our child 
subjects also assign the generic reading to sentences containing two 
existential Q-NPs. However, they did so to both sentence (22) and 
sentence (25), while adult subjects only assigned the generic reading to 
sentence (25) but not sentence (22). A fuller discussion and 
understanding of these and other results awaits further investigation. At 
any rate, we have provided evidence that children understand reijative 
scope and have knowledge of the syntactic considerations which determine 
available scope readings. 
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Table 1 
The Test Sentences ard the Results 
(16) (1?) 
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Craw x figure in y box. In x bex draw y figure. 
PREDICTIONS PREDICTIONS 
Huang: y.box has wide scope over x figure. Huang: xbox has wide scope over y figure. 
Lee: x figure has wide scope over y box. Lee: xbox has wide scope over y figure. 
Aoun & Li: y box has wide scope over x figure. Aoun & Lis x box has wide scope over y figure. 
RESULTS RESULTS 


(15) Hua yige tuxin zei meige gezi li. 


Draw one-Cl figure at every-CL box inside 
“Draw one figure in every box." 
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(16) Kus meige tuxin zai yise gezi Li. 
Draw every-Ct figure at one-Ci box inside 
“Oren every figure in one box." 
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Adult: 0.79% 


Adult: 96.03% 
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(18) Zai yige gezi ii hue meige tuxin. 
At one-CL box inside draw every-Ci figure. 
*In one box draw every figure.* 


(19) 
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RESULTS 


(21) Hus meige tuxin zei mecige gevi li. 
Draw every-Cl figu's at every-Cl box inside 
“Draw every fiqure in every box.” 
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Huang: xbox has wide scope over y figure. 
Lee: abox has wide scope over y figure. 
Aouns & Li: x box has wide scope over y figure. 


RESULTS 
(26) Zai meige gezi if dou hus meige tuxin. 


At every-Cl fox inside all draw every-Ci figure 
“In every box draw every figure." 


“Waa Doe 


Adult: 92.86% Adult: 4.76% 
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AGE GROUP 


(25) Zai yige gezi Li hue yige tuxin. 


At one-CL box inside draw one-Ci figure 
“In one box draw one figure." 


“moO Rom 
Adult: 76.19% 18.25% 
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Problems in the Acquisi:ion of Grammatical Tone’ 


Katherine Demuth 
Boston University 


1, Introduction 

The development of autosegmental phonology (e.g. Leben 1973, Williams 
1976, Goldsmith 1976) represents one of the most important advances in the 
development of phonology since the generative insights of The Sound 
Pattern of English (SPE) (Chomsky & Halle 1968). Yet the field of acquisition 
has been slow to adopt and integrate new perspectives from theoretical 
phonology, much the same as it has been slow to adopt and apply theoretical 
insights in the area of syntax. The present study attempts to address this issue 
by developing an autosegmental account of the acquisition of grammatical 
tone in Sesotho, a southern Bantu language. While this work represents the 
first stages of on ongoing research project, it raises several theoretical 
questions that will hopefully serve as a model for future study in this area. In 
particular it addresses three questions: 1) When/how does the child figure 
out that Sesotho is a tonal, rather than an intonational, stress or accentual 
language? 2) How does the child acquire tonal rules? 3) When do children 
become aware of OCP effects? 


2. The Prosodic Acquisition Problem 

In order to address the Prosodic Acquisition Problem we need to have a 
model or theory of what prosodic systems look like. For the A beg se of this 
paper I will assume a model of Lexical Phonology along the lines of that 
developed by Kiparsky (1982, 1985) and Mohanan (1982, 1986). A model of 
lexical phonology allows us, and presumably the child, to specify where and 
how pitch is assigned. Languages will differ to the extent that they assign 
pitch to various domains (i.e. morae, syllables, words) and at various parts of 
the eit (i.e. stipulated underlyingly, assigned lexically and/or assigned 
postlexically), and to the extent that pitch assignment may interact with other 
linguistic phenomena such as stress. 


‘3 Data collection for this work was supported by ge lee and Social Science 
Research Council (SSRC) grants. Data transcription and an hid has been supported by 
NSF Grant #BSN-8709938 and by the Social Science Research Council (SSRC). 1 would like 
to thank Nick Clements, Julie Croston, Charles Ferguson, Larry Hyman, Michael Kenstowicz, 
Paul Kiparsky, Chuck Kisseberth, Mark Johnson, Will Leben, K. P. Mohanan, Mpiko 
Ntsekhe, Bill Poser, Mpatliseng Ramaema, the Sesotho Tone Working Groups at the LSA 
Summer Institute at Stanford (1987) and at MIT (1988), and members of the Sesoth Tone 
Workshop at Boston University (1988) for providing stimulating discussion and assistance 
relating to this research. 
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The problem for the child is to figure out 1) whether the | age s/he is 
learning is a lexical tonal (e.g. Chinese), stress/intonational (e.g. English), 
accentual (e.g. Japanese, some Bantu languages) or a grammatical tonal 
language (Sesotho and other Bantu languages), 2) what tonal rules apply and 
where they apply (in the lexical or postlexical component, and 3) if there are 
any OCP effects. 


3. Acquisition of Sesotho Tone 

The data for this study come from a monolingual Sesotho-speaking child 
whose spontaneous utterances during interactions with siblings, ts and 
grandparents were sampled at 2;1, 2;6 and 3;0 years.) Only High (), Lowered 
High (+) and falling (“) tones are marked.? A subset of affirmative present 
tense utterances are censidered here. The data are not intended to be 
statistically significant, but rather provide an indication of general tendencies 
in the child's developing prosodic system. 


Sesotho word order is (S)V(O). This paper will focus on the verbal complex 
which is composed of the following morphemes: 


(1) (S) SM-(T/A)-(OBJ)-V-(EXT)-M (O} 


In the following discussion I will assume an autosegmental analysis with 
separate segmental and tonal tiers. Though Sesotho can be analyzed as 
having only High (H) tones, with Low (L) tone as the default value 
(Kisseberth 1989), I will refer to both H and L tones for ease of reference. 


4. A Tonal vs. Intonational, Stress or Accentual System 
At 2;1 years the child most frequently has a HL (or H+) final pattern at the end 
of many utterances, but not necessarily at the end of medial clauses, as in (2). 


1 Age is represented in years and months: 2;1 years = 2 years and 1 month. 


2 Studies of the acquisition of phonology have shown that that is a certain amount of 
individual variation in phonological development. We might therefore expect some 
individual variation in the types of tonal acquisition patterns exhibited by different 
children. However, the present study of one child will hopefully identify some of the 
issues. Analysis of data from a second child at the same ages is currently in progress. 


3 Lesotho orthography is adapted here, resulting in a broad phonetic transcription, 
though mid vowels follow the non-distinctive orthographic conventions. The second person 
singular subject marker u, is rendered here as 0 (phonetically identical to third person 
singular subject marker, except that third person is High tone). The present tense subject 
-aarker (SM) assumes an -a when the verb is final in the verb phrase. Gloss abbreviations 
are as follows: AGReagreement marker, APL=applicative/benefactive, COPscopula, 
.)EMzdemonstrative pronoun, DIM=diminutive, EXTs=verbal extension, M=mood, OBJ=object 
clitic, POSS=possessive, PN=inde pendent pronoun, suffix, PREP=preposition, SMzsubject 
marker, T/A=tense/aspect, WH=question word, 8=noun class 8, 2s=uecond person singular. 
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(2) 21 yrs. ankd ké end 
(ke-a-tik-a+ ké e-na) 
1sSM-PRES-take-M COP 9-DEM 
'T'm taking (it), here it is’ 


However, he more accurately produces clause final HL by 2;6 years (3), and is 
even more consistent by 3 years, possibly indicating access to the syntax 
through prosodic cues. 


(3) 2,6 yrs. tél:a wa wakhutsanydne 
(o-a-téll-a 6-mo-khutswa-nydne) 
2sSM-PRES-disrespect-M 1AGR-1SM-short-DIM 
‘you're spate gesaed shorty’ 


While the child's productions at 2;1 years might be consistent with an 
intonational analysis, it appears that by 2;6 years he has learned the tonal rule 
of phrase final lowering. 


Sesotho has a phonological rule of penultimate lengthening. (=stress?) which 
is especially prominent at the end of clauses and which children fairly 
consistently produce by 2 years. If the child adopted a stress type of analysis, 
we would expect the lengthened syllables to be marked with a consistent tone 
pattern. We have noted above the frequency of a final HL pattern: We might 
hypothesize that this is evidence of a stress=tone analysis, however further 
evidence shows that this is not the case. Even where Sesotho would posit a 
HL final pattern, the child demonstrates inconsistency. Thus, while HL final 
patterns are very frequent in the child's speech, as well as in the input, the 
phenomena is not especially robust. 


Clements & Goldsmith (1984:16) hypothesize that children learning Bantu 
languages might adopt an accentual (in the sense used in Clements & 
Goldsmith 1984) analysis as a strategy for facilitating factorization, or mapping 
between segmental and tonal tiers in a more linear fashion. While it is not 
clear exactly what kind of data would address this issue, a closer look at the 
child's tone on verbs is suggestive. 


At 2;1 years, 73% of all verbs, most of them disyllabic, have a tonal pattern of 
HL in phrase final positicn, or HH in non-final position (i.e. when an object 
follows). The fact that so many of the verbs have a H tone on the first syllable 
might be consistent with an accentual analysis of the type proposed by 
Clements and Goldsmith (1984), where underlyingly a verb would be 
associated with a diacritic (*) which would then be interpreted as a High tone 
at a latter stage of the tonal derivation. But it is difficult to distinguish the 
effect of marking underlying representations with a diacritic rather than with 
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a H tone itself (see also Pulleyblank 1986). On either analysis the important 
factor is that the child is treating most verbs in the same way. One would 
therefore have to look for other evidence, such as when/at what level, lexical 
tones are assigned in the child's grammar. A closer look at the child's 
productions shows that there is developmental trend toward distinguishing 
two groups of verbs; this is as shown in Table 1. 


Table 1. 


Thus, while verbs are more consistently marked as H at 2 years (4)-(5), they 
have been differentiate into H and L by 3 years (6)-{7). 


(4) 2;1 yrs. (5) = 2;1 yrs. 
tea hana a-kula 
(ke-a-hdn-a+) (0-a-kul-a) 
1sSM-PRES-refuse-M 2sSM-PRES-sick-M' 
'I refuse’ you are sick’ 

(6) 3,0 yrs. (7) 3,0 yrs. 
o-ngold lengolo? ke-kopa motoho 
(o-ngél-d le-ngédlo) (ke-kop-a mo-tohd) 
2sSM-write-M 5-letter 1sSM-ask-M 3-porridge 
‘Are you writing a letter?’ ‘I'm asking for porridge’ 


I suggest the there is ample evidence from the input for initially construing 
disyllabic L verbs as H (due to High Tone Spread from subject markers). It 
could well be that the child's initial hypothesis is that all verbs are lexical H 
toned. The df lanl underlying lexical tone of verbs would then be 
acquired g.adually on a item by item basis within the lexicon, or until facility 
with other verbal melodies forced a reanalysis of lexical tones. Evidence for 
this proposal comes from the fact that certain high frequency verbs such as H 
toned hana ‘refuse’ and L toned batla ‘want’ are more consistently produced 
as such by 2;6 years than are other less frequent verbs (both in input as well as 
in the child's productions), anc that lexical tones on verbs are largely in place 
by 3 years when morphological tone phenomena are being acquired. 


We might then predict that the acquisition of tone on subject markers would 
either 1) parallel that found for verbs (Le. all subject markers will be initially 
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marked as H), or that 2) there will be an early and consistent distinction 
between H and L subject markers. Unlike with the more uniform marking of 
verbs as H, there is a more consistent (perhaps underlying?) tonal distinction 
between H and L subject markers around the age of 2, as shown in Table 2. 


Table 2. 


There is 80% accuracy in the marking of both H and L toned subject markers 
at 2;1 years of age, as in (8) and (9) where the majority (78%) of the L toned 
subject markers at 2;1 years are the 1st person singular subject marker ke 'T, as 
in (9). 


(8)  2;1 yrs. (9) 2;lyrs. 
éa Kae? a eohd hdpe 
(6-y-d kae?) (ke-ets-a hdpe) 
1SM-go-M where 1sSM-do-M agair. 
‘where is s/he going?’ I'm doing (it) again’ 


The marking of L toned subject markers as L is consistent across time (10). 
This differs from the marking of H toned subject markers, where there is an 
actual decline in the appropriate tone marking by 3 years, as shown in (11). 


(10) 3,0 yrs. rond re-ngola kdng? 
(rong re-ngolé kd-ng?) 
lpPN IpSM-write-M PREP-WH 
‘as for us, what are we going to write with? 


(11) 3,0 yrs. a-chécha 
(é-d-chech-a) 
9SM-PRES-reverse-M 
‘it's reversing’ 


I suggest that the decline in appropriate marking of H tone subject markers at 
3 years may indicate that the child is beginning to deal with OCP effects (see § 
6.). 

Based on this preliminary data I suggest that Sesotho-speaking children are 


well aware by 2 years of age that their language is a grammatical tone 
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language rather than a purely lexical tona', stress/intonational, or even 
accentual language. I now turn to a discussion of tonal rules. 


5. Iterative High Tone Spread 

There is some evidence that children may have an early rule of Iterative High 
Tone Spread that persists until the morphologic] tone patterns (Melodies II 
and III) begin to be acquired around 3 years. IJon-final disyllabic verbs are 
fairly consistently produced as HH (as in 6), and final trisyllabic verbs as HHL. 
While there are a few cases of High Tone Doubling (ie. spreading only to the 
next syllable), as in (12), the norm seems to be iterative spreading, as in (13). 


(12) 2;6 yrs. séfofdnu syd-bidika kwina 
(sefdfdne sé-a-bidtk-a kwéana) 
airplane 7SM-PRES-revolve-M DEM 
‘the airplane is turning about over there’ 


(13) 2,6 yrs. wend d mdthéld — md::::ne chabadfmachetse kwana 
(wend o-math-ei-a ma:né Chabadimaketse kwdna) 
2sPN 2sSM-run-APL-M DEM Ch. DEM 
‘you're running WA:Y over there at 

Chabadimaketse, far away’ 


There are few examples of four syllable verb stems in the spontaneous corpus. 
Further experimental study will have to determine the extent of the domain 
to which the child's rule of Iterative High Tone Spread applies. 


Spreading on H tone subject markers is less clear; about half spread iteratively 
as in (13) above and (14), while others do not spread at all (15). In some cases 
the subject marker and tense/aspect marker have been collapsed into one 
syllable where one H tone is retained (16). 


(14) 2,6 yrs. (15) 2;1 yrs. 
ad weld nth(é) énd... ad eta méda 
(é-@-w-el-a ntho é-na...) (a-éts-a ména) 
9SM-PRES-fall-APL-M 9thing 9-DEM 4SM-do-M DEM 


‘it's falling, this thing...’ ‘they're doing (it) here’ 
(16) 2,6 yrs. a nyola kho:fu yéna 

(€-d-nyoloh-a khofu é-na) 

9SM-PRES-ascend-M 9shovel 9-DEM 

‘it's ascending, this dumptruck’ 


Data from after 3 years, when children more consistently produce preverbal 


morphemes, will better be able to determine the extent of H tone spread on 
subject markers. 
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6. Obligatory Contour Principle (OCP) Effects 
The Obiigatory Contour Principle, ur OCP prohibits two consecutive H tones 
from occurring on the tonal tier. Solutions to this problem range from 
conflation of two H's into one H, or alternatively, as found in Sesotho, 
delinking of one H and filling in with 4 default L, resulting in a HLH 
sequ..ice on the tonal tier. Again, it is difficult from spontaneous 
roductions to determine what the child's underlying representations might 
a One might interpret some of the child's productions of Iterative High 
Tone Spread from subject markers as being application of the OCP, where two 
underlying H tones are conflated into one (17). However, it could also be that 
the child maintains two underlying H tones, thus violating the OCP; the 
natural prod:ction data render it difficult to distinguish the two analyses. 
There are some cases where a HLH pattern results, as in (18), but increasingly 
at 3 year. the subject marker is produced as Low (19). 


(17) 2,6 yrs. (18) 2,6 yrs. 

koldy ydkd é thétha mokilu: bd-kukd mollo 

(koldy ya-kd é-thoth-d bo-kiulube) (b4-kuk-4 mé-Ilo) 

9car POSS-my 9SM-carry-M 14-horse dung 2SM-take-M 3-fire 

‘my car is carrying horse dung’ ‘they're taking the flame’ 
(19) 3,0 yrs. a-chécha 


(é-d-chech-a) 
9SM-PRES-reverse-M 
‘it's reversing’ 


I suggest that examples such as (19) may indicate an emerging awareness of a 
rule of High Tone Delinking, where underlying HH on the tonal tier becomes 
LH, perhaps a response to the OCP. The status of the OCP as a either a 
language universal, and therefore part of Universal Grammar (McCarthy 
1986), or alternatively as a frequent, but language specific rule (Odden 1986, 
1988) is as yet unresolved. It is hoped that further acquisition research may 
shed some light on the debate. 


7. Conclusions 

While the findings presented here are still preliminary, there appears to be 
evidence from spontaneous, natural productions that, at 2 years, the child 
knows that he is learning a grammatical tonal, rather than an 
stress/intonational, lexical tonal, or accentual language: There is no evidence 
of fixed tonal patterns that would imply an accentual analysis, nor a robust 
correspondence between the penultimate 'stressed’ syllable and tone. While 
verbs are predominantly H, subject markers are distinguished by H/L 
contrasts. Secondly, an initia! rule of Iterative High Tone Spread on verbs at 2 
years gives rise to morphological tone rules, a sule of H Tone Deletion, and 
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the lexical distinction of verb tones by 3 years. Finally, it would appear that 
the effects of OCP are learned much as otner tonal rules, around the same 


time as the other verbal melody tone patterns begin to emerge. 


While this study raises many more questions than it answers, it is hoped that 
it will stimulate future research not only on the acquisition of prosody, but on 


the acquisition of phonolcgy as well. 
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ACQUISITION OF GENITIVE AGENTS IN SAMOAN 


Alessandro Duranti 
University of California, Los Angeles 


Elinor Ochs 
University of Southern California 


INTRODUCTION? 


Developmental psycholinguists have been centrally 
interested in children's understanding and linguistic 
articulation of what Slobin (1985) calls ‘manipulative 
activity scenes'- in which an agent performing some action 
affects some object. The concern of the present study is to 
extend our understanding of manipulative activity scenes and 
grammar beyond the articulation of major sentential 
constituents, more specifically to attend to ways in which 
children and adults grammaticalize manipuletive activity 
scenes and perspectives within genitive constructions. 
Genitives have been primarily associated with the encoding of 
locative relationships such as possessor or goal (cf. Clark 
1978; Lyons 1968). In Samoan, however, the genitive 
construction encodes a wide range of semantic roles including 
human agents (cf. Duranti & Ochs in press). That genitives, 
often called "possessives," do not Simply or exclusively 
express re).ations of ownership has been noted by a number of 
scholars. Further, the link between genitives and agency has 
been reported in the acquisition literature (cf. Budwig 1985) 
and in typological studies of ergative languages, which note 
that in several languages, the genitive and ergative marker 
are the same. In Samoan the genitive marker and the ergative 
marker are not the same. Nonetheless there is a strong 
semantic link between the two. Our Samoan data represent to 
our knowledge both the most varied and the most recurrent use 
of genitive constructions for semantic roles other than 
possession. In this paper, we describe how Samoan adults use 
genitive constructions and compare adult strategies with those 
of four young children. 


RESEARCH BACKGROUND 

Samoan adults and children differ little in their 
expression of major sentential constituents and in the use of 
ergative case marking. Both prefer verb-initial utterances 
that contain only two major constituents: a verb or verb 
complex (VC) and a nominal argument. The NP expressed tends to 
be an absolutive NP, either Subjects of intransitive verbs or 
Objects of transitive verbs. ~ 3 basic structure of utterances 


1s thus: 


(1) ve + Absolutive NP 
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Du Bois (1987) suggests that in all languages speaker-hearers 
tend to avoid expressing Agents as full lexical NPs. Speaker- 
hearers typically identify agents from referents expressed in 
prior discourse as absolutive consituents. Our examination of 
Samoan speech and writing, however, suggest that this 
presumption requires further thought. In Samoan, Agent 
participants may be expressed through genitive constructions 
within the absolutive NP. If we take a strictly syntactico- 
semantic definition, viz. Agents to be Subjects of transitive 
clauses, then our data largely confirm Du Bois' findings. On 
the other hand, if we widen our notion of Agent to include 
potential or factual agents in described, evoked, or 
presupposed events, regardless of the grammatical role of the 
phrase in which they are linguistically expressed, our data 
show different results. 

In Samoan, the Absolutive NP of a two constituent 
utterance is often a complex NP that includes both an Affected 
Object (or Undergoer) as a Head Noun and an Agent or some 
other semantic role(s) in the Modifier. The syntax of these 
constructions is schematically represented in (2) (the angled 
brackets indicate an “either or" condition in the case of 
coreferentiality of Pro and NP): 


(2) Verb Complex + { Art <Gen Pro> Head Noun <Gen NP> ] 
NP 
While genitive constructions in Samoan often express a 
relation of "possession," they express a wide range of other 
participant roles as well. Thus, in (3), the genitive phrase a 
Eki 'Eki's' refers to the person who prepared the focd. Given 
that Eki is a young untitled male, it would be inappropriate, 
in a Samoan cultural context, to define the food he cooked for 
others as "belonging" to him. We consider this an example of 


genitive construction used to express an Agent: 
(3) ("Pastor & Deacon" )* 


24 fai le umu kalo a _ Eki ma luau 
deo ART oven taro of Eki and palusami 
(lit. make Eki's oven taro and palusami ) 
‘Eki made baked taro and palusami' 


Table 1 shows the distribution of different semantic 
roles in genitive phrases in adult speech. After Possessor and 
Body Part, Agent is one of the most common types of semantic 
roles expressed through genitive phrases. This finding opens 
up a whole series of questions about the definition and 
distribution of not only Agents but Actors, Experiencers and 
other semantic roles in a language like Samoan. Rather than 
the putatively "natural" or "universal" tendency for human 
participants to appear as Subjects, 4 tendency codified as 
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"Subjectivization" in Case Grammar (cf. Fillmore 1968; 1977; 
cf. also Kuno 1974) and “genitive ascention" in Relational 
Grammar (cf. Kimenyi 1980), Samoan seems to favor 
"Genitivization." 


Table 1 * 
Distribution of Semantic Roles in Genitives (Adult) 

Speakers: Semantic Roles** Encoded: ~ 
Ce ee ee ee 

POSS SEN GL/LC AG ACT EXP PART PNT REL/KIN 
Women 

19 114 -06 ~-16 .16 06 .04 .O1 16 

(27) (20) (9) (22) (23) (8) (6) (2) (22) 
Men 21 .12 -16 19 .08 .06 ~ 0 = -23 

(17) (10) (13) (16) (7) (5) (8) (19) 
TOTAL: -20..138.. -10 17.13 1.06 .06 .01 .23 


(44) (30) (22) (38) (30) (13) (14) (2) (51) 


*Each genitive construction may encode more than one semantic 
role. **POSS=possessor, BEN=benefactive, GL/LC=Goal/locative, 
AGsagent, ACT=actor, EXP=experiencer, PART=body part or other 
part/whole relation, PNT=patient, REL/KIN=social relationship, 
including kinship. 


There are, however, semantico-pragmatic differences between 
the use of genitive vs. ergative NP’s (Duranti & Ochs in 
press). In contrast to languages like English, whsre Subjects 
of transitive verbs can express a wide rangé of semantic roles 
(Keenan 1584), in Samoan, ergative NP's cover a restricted set 
of roles, typically human initiators of actions (cf. Cook 
1988). Furthermore, ergative NP's may index or assign 
accountability to the participant role (cf. Duranti 1988). 
When the genitive phrase, as opposed to the ergative phrase, 
is used to refer to the putative agent, the focus is on the 
product or result of the action (if the verb is a potentially 
transitive verb) rather than on the party responsible. For 
this reason, genitive phrases seem to cover cases that in 
other languages might be expressed by passives or stative-like 
clauses, where the Patient or underlying Object acquires the 
syntactic role of Subject. 


THE ACQUISITION OF GENITIVE CONSTRUCTIONS 

Is complexity of the Absolutive NP something that unifies 
both adult and child language? Or is it here that adults and 
children's speech differs? In contrast to acquisition of 
Clause structure, the acquisition of genitive construction 
shows a clear progression towards a broader range of semantic 
roles encoded and more complex head nouns. 
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Table 2 - 
Semantic Roles in Genitive Constructions (Children) 
Child/ Semantic Roles Encoded: 
Age: 
: POSS BEN GL/LC AG ACT EXP PART PNT REL/KIN 
Kalavintl 
(137) = - = 7 ve = - - * 
(1:9) = - = . . 7 = - = 
(17311) 1.0 - - ~ - ~ - - - 
(1) 
(23:1) 26 .68 Pe ee - - ~ - .03 - 
(10) (26) (4) (1) 
TOTAL: 29 67 10 03 
(11) (26) (4) (1) 
Iakopo 
(2.1) - 67 - - - - - 33 - 
(2) (1) 
(278) 45 48 - .03 Pas | -08 .16 - - 
(17) (20) (1) (4) (3) (6) 
TOTAL -38 43 .02 .O9 -0O7 13 02 
(17) (22) (1) (4) (3) (6) (1) 
Pesio 
(23:3) Pe) 35 -10 -O4 - ~ 04 14 07 
(11) (10) (3) (1) (1) (4) (2) 
(2;10) 273 14 07 ~0S OZ 02 .06 03 01 
(129) (24) (13) (8) (3) (4) (36; (5) (2) 
TOTAL 68 16 08 04 «UZ 02 .05 04 02 
(140) (34) (16) (9) (3) (4) (11) (S}) (4) 
Niulala 
(2311) 26 54 .19 -O3 .06 - -13 Os .O1 
(20) (42) (15) (2) (5) (10) (2) (1) 
(376) 32 o23 .02 09 .O02 £05. «3Z - - 
(14) (10) (1) (4) (1) (2) (14) 
28 43 is 05 .05 02 .20 O02 O1 
(34) (52) (16) (6) (6) (2) (24) (2) (1) 
TOTAL: -49 33 -O9 04 03 02 .10 03 .O1 


(202) (134) (36) (16) (13) (9) (421) (13) (5) 
*Each genitive construction may encode more than one semantic 
role 
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Table 2 indicates the acquisition patterns of four 
children: Kalavini, Iakopo, Pesio, and Niulala. At an early 
point in acquisition, children use genitives primarily to 
express possessor and benefactor roles. The youngest child, 
Kalavini, does not encode genitive agents at all. The next 
youngest child, Iakopo also does not encode genitive agents in 
the earliest recording session and produces only 1 seven 
months later. Genitive Agents account for somewhat more of 
Pesio's and Niulala's genitive constructions, with the last 
session of Niulala at 3 years 6 months showing the greatest 
proportion at 9%. These data suggest a developmental pattern 
towards increased use of genitive NPs to encode Agent roles. 
In the corpus at hand genitive Agents characterize 4% of 
children's gsnitive constructions in comparison with 17% of 
adult genitive constructions. 

Children's use of genitive constructions to express Agent 
roles is illustrated in (4) through (6) below: 


(4) (Pesio, 2;10) 


kusi::: -si:: lou aka?/ 

write -te your picture 
(lit. ‘draw -aw your picture?’ ) 
‘are you drawing the picture?’ 


(5) (Niulala, 2;11) 
masae le fofu]lvae [0] Fineaso 
ripped the pants [of] Fineaso 
‘Fineaso ripped his pants’ 
(6) (Pesio, 2310) 
sa fai makou mea'ai 


TA make our(excl) food 
'(We) made food for ourselves.' 


Genitive constructions of children and adults also differ 
in complexity of the head noun. In adult constructions where 
the modifier is an Agent, Actor, or Experiencer, the head noun 
is often a nominalization. In children's constructions, 
nominalizations é@.e both rare and relatively late to be 
productively acquired. These patterns are expressed in Tables 
3a and 3b. 
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Table 3a 
Nominalized Head Nouns in Genitive Constructions (Adults) 
re ee ee Swe ee ee 
Women: Men: TOTAL: 
13 .07 git 
(18) (6) (24) 
NN 
Table 3b 


Nominalized Head Nouns in Genitive Constructions (Children) 


i 


Kalavini Iakopo Pesio Niulala 
(137) = (2:1) - (273) - (2:11) .O1 
(1) 

(1;9) . (2:8) - (2:10) .91 (3:6) .10 
. (2) (4) 

(1:11) - 

(2;1) - 

a ep ee 

Total: - ~ -O1 .06 


(2) (7) 


I 


Tables 3a and 3b indicate that 11% of adult genitive 
constructions contain nominalizations, whereas only 2% of 
children's genitive constructions contain nominalizations. 
Table 3b indicates further that nominalized head nouns are 
absent or rare before children reach 3 and half years. To 
some extent this developmental pattern is linked to the late 
emergence of agents, actors and experiencers as genitive 
modifiers in children's speech. 


CONCLUSIONS 

The Samoan data presented here suggest that while Samoan 
adults and children both favor a clausal strategy of 
highlighting the affected object in a manipulative activity 
scene, Samoan children have difficulty exploiting the grammar 
of genitive noun phrases to encode agent roles as well. This 
pattern implies that children's two-constituent utterances 
differ from those produced by adults. In children's 
utterances, when an Agent is not encoded as a major sentential 
constituent, it is likely not to be encoded as a genitive 
modifier. That is, Agents are likely not to be found anywhere 
within the two-constituent clause. In interpreting children's 
speech, then, hearers must resort to one of the pragmatic 
strategies suggested by Du Bois, namely, locating Agent 
participants in the immediate setting or in previously 


102 


a ‘ sore ae ay SE Ge A eee a 


pan nt ae ERE ee eee ek 


ad 


XY 


Hl 
i 


95 


mentioned absolutive NPs. In contrast, interpreters of aduit 
speech may locate the agent participant inside the absolutive 
NP itself. 

For all acquirers, the morpho-syntax of noun phrases is 
an important dimension of linguistic competence . In Samoan, 
however, and perhaps in other languages with a two-constituent 
bias, genitive constructions, nominaliizations and other types 
of complex noun phrases lace even the most informal of 
conversations. In all kinds of Samoan talk, the absolutive NP 
in a two-constituent utterance is often heavy, loaded with 
information concerning human participants and the actions, 
states and locations that bind them. Speakers reqularly 
produce such verb-initial utterances as ‘Look at the 
stretching of that one’ (Va'ai le fa'ake'e'ku'u a lele (PI- 
3:24)), ‘Exceptional is the anger of the girl' (Ese fa‘ali'i o 
lea kegikiki (PI-3:46)}, 'Look at the actions of Sio’ (Va'ai 
le fai'iga o Sio (PI-9:50)), 'Do you know about our going to 
New Zealand?' (E ke iloa ‘oe le maa ooga i Giusila? (uaki: 
377)). That such constructions are used so often and witli such 
a variety of meanings suggests that the internal structure of 
the noun phrase is a particularly central domain of 
grammatical and conversational competence for Samoan children 
to acquire. 


NOTES 


1) This paper is based on research sponsored by the National 
Science Foundations (Grant No. BNS-8608210, A. Duranti & E. 
Ochs principal investigators). 


2) Abbreviations: AFF= affect particle; ART= article; Dxk« 
Geictic particle; EMP= emphasis particie; INTs intensifier, 
sometimes with reflexive function; Prep= preposition; pro= 
clitic pronoun; TA= tense/sspect marker; PST=past. 
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ACQUISITION OF NULL SUBJECTS AND CONTROL 
IN SOME SINHALA ADVERBIAL CLAUSES” 


James Gair, Barbava Lust, Leiwala Sumangala, and Milan Rodrigo 
Cornell University 


General Introduction: This paper reports selected results from a large project concerned with the 
ucquisition of empty categories(EC's) in Sinhala, an SOV language of the Indo-Aryan family, spoken 
in Sri Lanka. It concerns itself specifically with EC’s which occur in a subset of adverbial clause 
types differing with regard to the kinds of null subjects they permit, including those which are 
obligatorily coindexed, Le., ‘control structures . 

There have recently been 8 number of studies of the first language acquisition of ‘contro!’ 
structures in English (e.g, Cohen Sherman 1983, Cohen Sherman and Lust, 1987, 1988, Hsu, Cairns, 
Fiengo, 1985). To date, however, there has been little or no study of the acquisition of such 
structures in other languages, such as Sinhala. (See, however, Lust, Waksyama, Snyder, Mazuka and 
Oshima for a related study of Japanese acquisition; arai Lust, Gair, Goss and Rodrigo 1987 for an 
earlier study of the acquisition of Sinhala EC's). 

Typological Background: Sinhala is verb final and strongly, in fact virtually exclusively, left 
branching and right headed. Both complements and modifiers appear to the left of their heads, as 
shown in (1): 


1 NP:  lagkaawe tee Sri Lanka~GEN tea -‘Sri Lankan tea / ses of Sri Lanka’ 
gunopaala geno kataawa Gunapala about story-DEF="The story about Gunapala’ 
VP: tee biwwa tea drank- ‘drank tea’ 
ikmansTa diwwa quickly run-PAST-'ran quickly’ 
AP: hufigak raso much tasty='very tasty’ 


PP: lapkaawe e@tule Sri Lanka within-'in(side) Sci Lanka’ 


Recursive sentence embedding shares this left branching character. As exemplified in 2a and 
2b, relative clauses appear always to the left of their heads, and finite adverbial subordinate clauses 
appear to the left of the main clause in unmarked order. 


2. a.{[mama gunapaalets dunno] pots) [ilapee iskoole ugannena) 
I Gunapala-DAT gave-REL book our  school-loc teach-REL 
guruwereyek liwwel ekek! 
teacher-INDEF wrote-REL one 
‘The book that I gave Gunapala was one written by a teacher who 


teaches in our school 
b. [liye gedara ifidalajawmetes yanskots}] mame loku nayek da@kkal 
yesterday house from town-DAT go-when I big cobra see-PAST 


‘| saw a big cobra when I was going from home to town yesterday’ 


Variant orders are, however, possible, generally with discousse-pragmiatic effeci. Thus, in a 
simple sentence , in addition to the unmarked SOV order, all possible orders of major constituents are 
acceptable with the proper intonation. This freedom extends also to subordinate clauses, including 
the -aame and -la clauses with which we will be chiefly concerned. 
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Three further characteristics of Sinhala are extremely important here. First, finite verbs even 
if tensed fail to show agreement of any kind in the spoken varieties of the language!, as shown in 
(3). 

3. mama/ oyaa/ eyaa/ api/ oyaala/ eyeala pansalote yanowa/giyaa 
1 / you/ (s)he/ we/ you-PL/ they temple-DAT go-PRES/ go-PAST 
‘l/ you/ (s)he/ we/you-all/ they go/went to the temple 


Secondly, Sinhala is a language of the kind that might be called “super pro-drop’, ie, 
allowing EC in all argument positions governed or ungoverned (except for object of postpesition) (cf. 
Sumangala 1988). (4a) illustrates this for simple sentences; (4) for complex. Thus, EC ‘s in Sinhala 
are widely determined by pragmatic context or discourse context for ambiguity resolution. 

Thirdly, although lexical pronouns (e.g., eyaa “(s)he’, etc.) anc lexical anaphors/reflexives 
(eg, taman/tamun'self") do exist in Sinhala, their appearance is far less common than that of null 
pronouns and is related to intentions of contrast, emphasis, or ambiguity resoludon. (cf. Gair, 1970). 
Comparison with English translations in (4) will Mlustrate this. 


4. a. TicketAgent: menna noonage fiket-eks Customer. © @ 6 denna 
here lady~GEN ticket-DEF 806 98 gve-INF 
‘Here is your ticket (madam).° ‘Give (me) (the ticket)’ 


b. @ maTa enakota @ okkoma kaala iweray 
@ I-DAT come-when © all eaten finished 
‘By the time @ came to me, all(food) was eaten and finished’ 
or ‘By the time it/they came to me, all(everybody) had finished eating’ 


As current linguistic theory would predict, however, while Sinhala empty categories are prolific and 
often pragmatically (non-grammatically) determined, their distribution in Sinhala is grammatically 
constrained, and their interpretation is grammatically differentiated. For example, an empty pronoun 
may not c-command its antecedent, (5), and co-reference with a c-commanding antecedent in a 
local domain is not possible with a coreferential interpretation(6) . 


5. G7; gunepaalage; ammata kematly @ gunapala-GEN mother-DAT likes 
‘Hey /; likes Gunapala’s; mother.’ 


6. gunapaala; Oy /; kannaadiye dakka Gunapala @ mirror-LOC saw 
‘Gunapala; saw him / *himseif; in the mirror.’ 


One effect of the combination of lack of agreement, relatively free word order, and the 
wide distribution of empty categories is the severe attenuation of surface signals accompanying 
different empty categories. in Sinhala This might be expected to heighten the problem of first 
language acquisition , to the degree that this acquisition depends on such signals,. As we shall see, 
this problem is even further compounded in the case of one set of adverbial clause structures. 


The-la Conjunctive Participle: Sinhala shares with many other languages of South Asis the 
existence of a conjunctive participle? commonly cited as an important areal feature (Emeneau 1956 
and much subsequent work...see particularly Masica 1976). The form fulfilling this function in Sinhala 
is the -la participle, as illustrated in (7) The sense is commonly, though not necessarily, temporal, 
with the action of the embedded clause preceding that of the main one and commonly linked to it in 


some fashion. 
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7. mame gedero gihile keems kerwa 

Lhome go-ig food eat-PAST 

T went home and ate.’ 
The embedded Ig clause in 7 is in its unmarked position preceding the remainder of ¥ (VP), and is 
clearly within the minimal dominating S (), This ts shown by a number of characteristics, such as 
inclusion in the scope of negation of the main verb, which we will not pursue here, but it is perhaps 
ihistrated most Gramatically by the possibility of sentences like (83), in which the subject is in the 
nominative case, as required by the main verb diwwa ‘ran’ although the embedded -la participle 


zhila (@henewa ‘hear’) would require the dative, as shown it. .8b)4 


8. a. mamo saddoyak xhila diwwa I-NOM sound-INDEF hear-la run-PAST 
‘l heard a sound and rar’ 
b. mata saddsyak ehuna I-DAT sound-INDEF hear-PAST 
‘l heard a sound.’ 


A crucial characteristic of -]g clauses in the conjunctive function is obligatory 
coreferentiality between main and -la clause subjecis as in (9a); Le, they are control structures. 
Their contro! properties include the fact that they do not allow an overt NP, as in (9b). 


9. a. ‘sma lypls Bis gedare gihila] keame kzzwa ypls) 
home go-lg food eat-PAST 
T went home and ate.’ 
b.*[gmama {vp {s Kalyani gedero gihillas} kerma kzewa ypls) 
I Kalyani home go-la_ food eat-PA 
‘Kalyani went home and | ate’ 


We thus hypothesize that the basic structure for (7 and 9a) is as in (10)5 Here the EC subject in the 
la clause occurs within a nonfinite clause; it is c-commanded in a basic “control’ configuration (cf. 
Huang, 1989). 


10. 


Vv INFL 
CONJUNCTIVE -la 


-aamo Adverbials: The EC in the conjunctive -la structure can be contrasted with that which 
occurs in a finite tensed adjunct subordinate clsuse, such as -the aamg ‘when/after’ clauses 
ilustrated in (i). The -aama_ form is made by adding that affix to the past tense adjectival form of 
the verb, the chief use of which is to form relative clauses as exemplified in (2) earlier. The 
adjectival form with -aama is always past tense. Hence from dunno ‘gave (relativizing) is formed 
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dunngame ©, The sense is prior temporal "when", with differences from the similar use of -la that 
need not concern us here. 

The -game construction easily involves the lexical expression of two separate 
subjects, as shown by (ila). Coreference is not required of 3 null subject, as in (lb). (A lexical 
pronoun may replace an EC in an -gama construction, as in (fic) although a lexical pronoun in this 
case, as in many cases in Sinhela, tends to favor a non-coreferential reading.) 


ila. mahatteya aawaame mame weeds kerannay 
gentleman come-aama { work do-OPT 
’ Twill work when \after) the gentleman comes.’ 


b. gunapaala, gamata giyaama jj gafigee newewa. 
Gunapala village-DAT go-aama © civer-LOC bathe-PAST 
‘When (after) Gunapala, went to the village (hej ; bathed in the river) 


c. gunepaala; gamote giyaame eyaa ;; gafigee neewa. 
Gunapala _village-DAT go(when) he/she river-LOC bathe-PAST 
When (after) Gunapala; went to the village hej ; bathed in the river) 


Given these characteristics of -aama clauses, especially their non-control properties, we assume that 
they are adjuncts outside the minimal S, and that their subjects are not c-commanded in a control 
domain by the main clause subject, as in (12). 


12. -aama CLAUSES 


- 
1 
( “N 
DIN vN 
7 YV “wr comp ok OF) 
pro /N ems -gaina 


We may now ask whether the child scquiring Sinhala knows the subtle differences 
between the EC's in -la and -aama constructions and if so, what is the nature of the development of 
this knowledge? We have already noted the paucity of surface cues such as agreement, fixed 
constituent order and narrowly restricted distribution of EC's. One obvious possibility would invoke 
the subordinate morphology itself; i.e. the co-occurrence of different EC’s with those affixes. 
However, this possibility is confounded by the lack of one-to sne co-occurrence between -jg and 
subject EC type. This is a function of the fact that Sinhala -la has other, non-conjunctive uses, which 


we will briefly describe. 


Absolutive and Finite -la: A structure with -la may occur productively as a finite sentence, as 
in (13), and such sentences are by no means uncommon (Note that 13c. shows both conjunctive and 
finite -la.). 
13.2. mahattaya kantooruwete gihilla. gentleman-NOM office-DAT go-ja= 
"He(hon.) has gone to the office.’ 


b. gunopaala mee wede okkome iwerekarala Gunapa!s-NOM this work all finish-la- 
‘Gunapala has finished all this work. 
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c. kalagediya meesep wetila kedila water jug-NOM table-ABL fall-Ja break-ja- 
‘The water jug has fallen off the teble and broken.’ 


Furthermore, -Ja also occurs in an ‘absotutive’ construction. This is a subordinate 
structure, but it allows a lexical subject and shows none of the syntactic reflexes of a control 
structure. Non-coreference between subjects is not only allowed, but common, as in (14). 


14a. amma gamote gihilla, mama seerama gedare weeda karanns cone 
mother village-DAT go-lg L-NOM all house work do ny.cessary 
‘With Mother gone to the village, I have to do all the housework’ 


b. loku mahattayata asaniips welsa, mehee kaurut weeda keranne nex 
big boss-DAT _ sick get-made-la_ here everybody work do NEG 
‘With the big bass sick, no one here is working.’ 


On theory-internal grounds, we assume that the -la clause in (14) reflects an underiying adjunct 
structure like the -agma clause in (12), and not like the control -jg structure in (10). Thus in (4a 
and b) , the -Ja clauses ure outside the minimal S containing th< main verb, and their s_bjects are not 
c-commanded by the inain verb subject. 

The relevant properties of the three-lg constructions are compared with each other 
and with -aame in the table in (15) 


15. “JBJECT PROPERTIES OF -la AND -aama 


a 
eS ol 
(Conjunctive) }(Absolutive)| (Finite) 
| Obtgarory Subject Coreference| + | = | - | | 
[vo Possbie Over subect | + | = | | 


Fcconmanded yMansaea] > | - |= | | 


As (15) shows, -aama patterns with the absolutive and finite -la_ in relation to the type of 
subjects it permits, though it is never used as the main verb in independent sentences. Thus, the 
problem of the non-congruence of morphology with nominal type is rendered even more complex. 
The learner must dea! with four structures, of v‘uich three invo%ve the same verbal morphology, and 
the control structure involving -la is not morpt slogically distinct from the other two Ia structures, 
which pattern with -gama in terms of the king of subjects allowed (and for the absolutive one, in 
contro! properties). 

The Acquisition Problem : Given the lack of straightforward surface cues in Sinhala, we can 
hypothesize that only sbstract syntactic structure, presumably configurational differences, can 
account for this differentiation of EC’s in adverbial structures. Critically, Sinhala does not allow the 
child to depend solely on surface cues of morphology, as the foregoing has made clear. For the child 
acquiring Sinhala, then, only 2 sensitivity to such structural differences could sccount for their 
differentiation of EC’s in this set of adverbial types. Critically, if the child acquiring Sinhalese is 
found to differentiate not only the EC's in the contro! -[g and the -aame constructions, but elso the 
different types of -la, where no morphology is available, then we have a powerful argument for such 
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sensitivity, In the remainder of this paper we will adduce both experimental ard natural speech 
evidence for this ‘structure-dependzar*’ in early acquisition. A strong theory of Universal Grammer 
would predict that the child was equipped with such principles of ‘structure dependence’ “™omsky 
1986), and recent work has argued thet children evidence such principl:s continuoudy © sly 
stages of first language acquisition in other domains of anaphora (Lust, 1986, 1967). 

The Acquisition Evidence: (s) Natural Speech. We have begun to study the ne . speech 
of 74 children acquiring Shuhala as a first language, from 2 years, zero months to ‘our years, zero 
months, with a mean age of 3,01, divided into four 6-month age groups. Each speech sample had a 
mean of 133 utterances and was collected by a native speaker in village homes or schools in the 
Kadawatha area of Sri Lanka. These caaly:.s revealed a total of 438 utterances with various -la 
constructions, rising from a mean of 3.§ ner child in the youngest age group to 10.5 in the oldest age 
group. In contrast, only 47 utterances with -aame were found, remaining at a low mean rate of use 
over development, ie., .07 in the youngest group, .84 in the oldest. Most interestingly, however, 
from the earliest age group, all three types of -'g construction w:.e evidenced, as table 16 
exemplifies. The control type ‘conjunctive’ -'a is productive at all age levels. 


EXAMPLES FROM CHILDRENS NATURAL SPEECH: DIFFERENT USES OF -la 


G#; AGE Unt) 
Group | 0) coniunctive: mee, gaumak efidalz innowe $05; 200,49} 
jook, frock wearis be(animsio)-PRES « 
Look, she is wearing a dress 
(2) ghyohutive: nangi-t kearckee nrgola yanows nang-t xia 0005:2.00-67) 


little sister car-LOC enteria go-PRES little sister-alo with - 
Little sister is going in Use car with the litte sister’ 

(3) Sings oon, amma genslls, occ ? (009-2.04500 
there, mother bring-is, TAG 
There, mother has drought &, hasn't she?’ 

Group 2:(4) coniunctive:: budiyanakota maduruwo swila kanawe (2 iS) 
sleep-when mosquitoes came-ta bite-PRES- 

"When I sleep mosquitocs come and tite me’ 

(5) gbsohitive: bakti-gika gihills sswa afivo-berce (209;2.07102) 
‘phakti'-song-PL go-ia come-PAST (afigo)-diums 
‘Bhakti songs passed and the (aids) drums appeared’ 

(6) finite belt eka kendila ( 2.205) 
beit-one break-is 
‘The belt is broken’ 

Group 3:(7) conjunctive: api mee pask ek2afa gihils meek elowanowa nee? 
we this perk-DEF-DAT go-is this one drive-PRS (¥723.0573 
“We wil go to that park and drive it, woot we’ 

(8) abdsolytive: ekkan avila, amma innows gedore (906;3.066.) 
Ddring-la mother be(animste)-PRES home 
Having been beought. mother is at home’ 

(9) finite: arsket paraiiis ( 308:3.0£9) 
that one-elso overturn-ia 
‘That one, too, hes dasn overtumed’ 

Group 4: (0) conjunctive ekki arwille ask yanows (408;3.08,88) 

hig-sister come-is again go-PRES 
‘Big sister came and is going beck’ 

(2) gbsotutivs pettyake daais thbe mocks (43, 4.00.25) 
box-INDER-LOC put-ia be-PAST-ROC this-one 
‘Sameoneyhaving put (0 in s box, thist-it) remained 

€3) finte: ewunge amma avis (405;3.08,99) 
they-GEN mother come-is 
‘Their mother has come’ 
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The Acquisition Evidence: (b)Experimental Data. Although the observed productivny in 
natural speech does suggest both an early knowledge of the Sinhala control structure, and @ 
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differentiation of closely related adverbial structures, the natural speech data alone do not 
unambiguously identify the factors which children are consulting in differentiating these structures 
and their EC.'s To more precisely test the nsture of children's knowiedge, we have conducted 2 
series of experiments to evaluate the specific factors which may be involved in the child's 
representation of these structures. 

The Comprehension Test. in one of these studies, we tested 169 Sri Lankan children from 2 to 6 
years of age (mean 4,8) acquiring Sinhalese, on a set of 16 experimental sentences in a standardized 
‘act out’ test of comprehension. Haif of these involved a -[g construction’ ; half involved an -aama 
construction, as exemplified in the table in (1 7). All involved an EC in subject position of one clause. 
Each of these sentences was varied in design according to two syntactic factors, as the table 
suggests. Branching Direction (whether the adverbial clause was preposed (LB) or postposed(RB)); 
and Proform Direction (whether EC followed(forward) or preceded(backward) the subject name in 
the sentence). Finally, 70 of these children received the experimental sentences with an initial 
‘pragmatic lead'(PL) to the name in the sentence. For example, sentence ! in ({7 ) was preceded by 
a sentence, similar to “Now I am going to tell you a little story about the monkey’. 
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-la EXPERIMENTAL SENTENCES: ACTOUT TASK 
Porward Left y 
L watidura keseigediya sbulols 63a wanw 
monkey banana (having) pkked-up 6 hand waves 
When the monkey picked up the banana, © waves the(his) 
hand. 
2 kopya bas-eka perabla 6 rawunmek duwonws 
tiger Dus-the fhaving) knocked~down © circle runs 
When tie tiger inocked down the bus, 6 nuns in a circle. Whes © dropped the benana, the fing somersaults. 


Forward Right Backward Right 


13. wafdsure natanswa © keseigediya wiiskorols 
morkey dances 6 Danang having) thrown 
The monkey dances when © threw the Danang. 


4 gemba pinumak gehanowa © {opiyo pahurugaais 
frog somersaults © wiee (having) pawed 
The frog somersaults when 6 pawed the toffee. 


9. waifidurs boole ahulspuwsame 6 appudigahanova 
monkey bal pick-upiwhen} © ind -<claps 
When the monkey picked up the ball 6 claps the {his} 
hands. 


$0. gemba bas-cka adopuwaame 6 muun> atullnews 
frog busthe pulfwhen) 6 face rubs 
When the frog pulled the bus, 6 rubs the(his) face. 


Forward Right 


i. wafidure nondigahanovs © bas-cko alopuwaans 
monkey imps Gburthe wuch(whea) 
The monkey imps when © touched the bus’ 


2 balla oluwe hollanowa © geb paagopuwzsue 
dog heed shakes 6 stone step-ca(when) 


The dog shakes the(his) head when © steps on the stone. 


7, O rawumak duwsoswe puuse bas-eko stomrois 
6 csvle no cat — bus-the (having) dropped 
© runs in s circle when the cat dropped the bus 


3 © bereio shulsmuwasm> puusa ud penises 
6 busthe pick-up(when} ca up fumoe 
When 6 picked up the bus, the cat limps up, 


4 6 boole rcokkarspuwaam> kotlya pinungshanows 
6 del rok(when) tiger somersaults 
When © rofied the ball the tiger somersaults. 
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This design allowed us to test the following hypotheses regarding children’s knowledge 
of -ia and -agme strucures: 

i If children do distinguish -la structures as ‘contro!’ structures in distinction from tne 
-agame adjunct structures, then they should assign significantly more coreference in their 
interpretation of the -Ja structures , which involve obligatory coreference, than in their 
interpretation of the -aame structures. They should assign significantly more disjoint reference 
responses to the -game structures, which involve a free EC, than to the -Jg structures. They should 
also allow the pragmatic lead to influence their interpretation of the EC in the case of the ‘free EC’ in 
the -aama clause significantly more than in the controlled EC in the -[s clause. 

ii. In particular, if children distinguish -la and -agama sentence types in terms of their 
abstract representation, as in 10 and 12, then children should differentiate the -la sentences 
according to the factors we varied experimentally. 

Results: Analyses of results by standardized scoring criteria included the following: 

i We consider first the main effects, camputed by ANOVA, on amount of children's 
judgements of coreference(CR)) between the EC in the sentence and the name. Children made 
significantly more coreference judgements when responding to a sentence with -[g than to a sentence 
with -aama when the -la and -aamg sentences were analyzed as a total set (F(i,95)=7.96, p».006). 
Overall, children computed a mean number of 1.20 coreference judgements (CRJ) with -la; 106 with - 
game type sentences. (Score range -0-2). A higher amount of CR] on Ja was consistent over 
development. They made significantly more disjoint reference judgements (DRJ) when responding to 
a sentence with -aama than with -la (F 1,95-10.47, p-.002) ( -aame type .08; -la .02.) 

The factor of Pragmatic lead (PL) significantly affected CRJ on both -la and -same overall. 
However, analyses of interactions among the factors we varied showed thst in the case of -la, the 
effects of PL were more limited . 

ii. Children did differentiate the -la structures according to the experimental factars 
manipulated. For example, the la sentences! and 2 had the highest amount of CRJ(151, Le., about 
75%, of the data,), of all conditions for either ‘la’ or ‘aama’. The corresponding sentences with 
‘agama’ on table .17, Le, 9 and 10, had less CRJ (119)..There was no significant effect of PL on the 
paradigm control -la_sentences | and 2 (1.56 vs L51 with and without PL respectively.); although there 
was a significant effect on -la structures overall (F(1,161)*8.90,p-.003), as there was for -sama 
overall (F(1,161}=12.98,p-.0004).). 

The high CRJ values for the ("Forward Left” } sentences { and 2 {on table 17) actually 
corroborate our basic hypothesis here in a serendipitous way that was not entirely anticipated under 
the original design. These sentences were designed under the assumption that they could and would 
reflect EC’s 2s shown on the table; that is, that the overt NP subject would form part of a-la clause, 
outside of the main clause, analogous to the -gama sentences. That is, they were expected to be 
interpreted as in 18a and b, with a tree structure paralle! to the -game clause in 12. 

18 a. [waiiduraj keselgedi ahulala] 0j4 ate wananawa 
{monkey; banana pickup-la] i, hand waves 


b. _ {kotiya; bas-eka persiala] @j4 rawumak duwenawa 
[tiger bus knockdown-la} 4 circle runs 


However, these sentences 1 and 2 are also susceptible to interpretation as in (19a and b), Le. as the 
canonical ‘control structure’ of the tree representation in (10), and they map onto it ina 
straightforward fashion. 
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19. a.  waikdura; (2; keselgedi ahulela] ate wanonowa 
monkey; { ©; banana pickup-ig] hand waves 


b.  kotiya; [Qj bas-eka peralola}] rawumak duwanawa 
tiger; 10; bus knockdown-jg] circle runs 


Note that the surface linear order of constituents here is the one that occurs commonly in the 
children's natural speech for conjunctive structures where there is an overt main subject (viz. the 
relevant examples in the natural speech table (16)). In both natural speech, and experimentally, then, 
the children appear to have strongly favored that ‘control’ mapping . eccounting for the fact that this 
category of sentences shows the highest CR] and resistance to the effects of pragmatic context.. 
Contrast this with the (‘backward right” ) sentences 7 and 8 in (17). In these, not only is the 1a’ 
clause extraposed out of the canonical control domain, but the presence of an overt nominal in the 
postposed -Ja clause blocks the control ‘la’ interpretation, and forces the absolutive one. These show 
the lowest CR] of any of the types tested (.82). The corresponding -game sentences, 5 and 16, 
showed higher CRJ (1.01). than 7 and 8, signifying once again the child's distinction between -la and - 
game clauses. 


Conclusions: In conclusion, these experimental results suggest a continuous ‘stucture-dependence’ 
in children's early hypotheses about the prolific EC's in Sinhals., including those involved in ‘control 
structures’. As we have shown, Sinhala EC's are not differentiated by surface cues such as verb 
morphology, or surface agreement. Thus, the fact that children did distinguish the interpretation of 
EC's in the full set of adverbial forms we tested appears to reflect sensitivity to the abstract 
structure of these forms. Critically, in -]g sentences 1 and 2, which were susceptible to the canonical 
control structure, children took the option for a control interpretation. They clearly did not 
assimilate these to the adjunct ~agame sentences. 

In terms of linguistic theory, the full set of results appears consistent with a “generalized 
coatrol theory” such as proposed by Huang (1989), wherein EC's are significantly differentiated by the 
configuration in which they appear: although this remains a direction for future research. In terms of 
first language acquisition, these results cohere with previous results on Sinhala acquisition derived 
through an experimental test of production (elicited imitation), Le., Lust, Gair, Goss and Rodrigo 1987. 
More generally, they cohere with results attained on English acquisition of control by Cohen Sherman 
1983, 1987, Cohen Sherman and Lust, 1988, 1986 which provide evidence for continuous structure- 
dependence in the child acquiring English contro! structures. They also cohere with results attained 
from experimental study of Japanese acquisition of adverbial structures (Lust, Wakayama, Snyder, 
Mazuka and Oshima, 1987). The crucial issue these results raise is: if it is abstract structure that is 
consulted critically by children in the differentiation of EC's, how is this knowledge determined, ic.. 
how is it mapped correctly to various surface structures? What principles does the child use to 
determine what constitutes a ‘paradigm control stucture’ as opposed to an adjunct structure? The 
experimental results reported above in Sinhala begin to identify the type of structural factors that 
were consulted in this mapping and point the direction for future research. in these areas. 


POOTNOTES 
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to be discussed ister. 
3 One frequent scare is senilar to that of the Engish verb plirese conjoining: thet do not obey the conjoined structure constrait ( Le do nt 
impose sufecency?, of the kind noted by Ross (067): “What did you go to the store and buy?’, and they commonly tracsiate ach arectures. 
There are otbar definable senses as vel, auch as tind of manner coe: sumo mire deals korea badass 1 chilies pute food mace’ | made 
the food{ by) putting in chillies’ 
4 A sentence With a dative case subject, Dut otherwise identical to (Sa) is in fact posible, as in. (ic): 
fc. mato saddoyak ails dwes. 


‘Theard a sound end ran.’ 
However, there is a clear difference in that (6) wil abow 0 direct case nominal, even though coreferent with the detive one, to 
occur as subject of the main clause. Thus (8d) is grammatical, (though uniilely except in mecial circumstances, given the general spersencss 
in pronoun use). However, (fe) where the embedded subject is spalied out, is imponibie: 


63. mate saddoyak ehis mam dw e “wan mets saddvyek ehile dwwa 
1-DAT sound-INDEF heard J4 -d ren NOM DAT sound-INDEF heerjg ran 
: Thgard a sound and 191 T heerd s sound and ren.’ 


Swe have asumed an {in SPEC Y structure, bat that does not affect our zigument. Ths crucial polnt ts thet the clause is within IX-S) and 
clearty c-commanded by the higher. co-referentisl, subject. 

6 Foe some speakers it s formed by adding it to the relativizing form besed on the participlal xem, bence dioussma, but this is = simple variant 
not effecting the distribution and does not matter for our purposes. -samng is historically 2 contracted form of bame (probably ultimately « 
Sansksit sqmaye ‘time?, and in careful speech one may stil besr_dunie ban or diou heme, 
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A Sensitive Period for the Acquisition of Complex Morphology: 
Evidence from American Sign Language 


Dennis Galvan 
of Cognitive Science 
University of California, Irvine 


Recent investigations by Mayberry, Fischer, and Hatfield (1983) and 
Newport (1984), into the use of American Sign Language (ASL) by deaf 
persons with deaf parents (native signers) and those with hearing parents 
(late signers), suggest that late signers are using signs as unanalyzed 
wholes. Newport (1984) provides striking evidence of this froma study 
conducted by Ted Supalla and herself. Results indicated that neither age, 
nor the mumber of yaars of siqning was predictive of siqnificant effects. 
In contrast, the age at which the deaf person first learned to sign 
(native, early, or late) was a significant factor. The signs of these late 
learners did not incorporate all of the required inflections. 


Newport asks why it is that older deaf children or adolescents 
learning ASL for the first time (early and late siqners), are not as adept 
at learning the morphological comlexities of ASL as are young native 
signers. After all, physiologically an older child should have mre 
available memory and better cognitive processing abilities than a younger 
child. One possibility is that an older child’s ability to see the gestalt 
of a sign may actually impair his/her ability to see the parts. In other 
words, late signers may be using the lexicon, or sign, as their basic 
linguistic unit of analysis. 


If we assume that a young infant has limited memory and/or cognitive 
processing abilities, it follows that the infant native siqner may only be 
able to comprehend components of a sign, such as its handshape or 
movement, and may be unable to retain very many features of « sign in 
mind at any one time. If Newport’s suggestion is correct, aid the late 
signers are using a holistic unit of analysis, then we shwuld see very 
different patterns of development in majer morphological subsystems for 
native and non-native signers. 


As native signers mature, the internal morphological complexity of 
verbs in ASL increases (Newport and Meier, 1985). This development can be 
expected to continue past 5 and possibly as late as 10. If the late 
signers are using a4 holistic approach, then the morphological camplexity 
of ‘he sign should remain constant acress age. If the late signers are 
using an analytical approach, we could expect the development of the 
various subsystems to parallel that of the native signers. This study 
investigated the morphological camplexity of verbs used in a narrative by 
native and late signirs deaf children. 


Method: 
To investigate this possibility, the acquisition of three independent, 
yex simultaneously produced nerphological systems in ASL were examined. 


These include; the linguistic use of space, the use of classifiers, and 
inflections for aspect. Unlike spoken languages, ASL incorporates 
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ackiitional information into a sign via simultaneously produced layers 
rather than sequentially produced units. Variations in the use of space 
(where the beginning and end points of a sign are), the use of classifiers 
(handshapes and linguistic use of the body) er in aspectual inflections 
(changes in movement) all contribute unique information to a sign. 


Thirty deaf children with severe or profound prelingual hearing losses 
and no secomdary handicapping conditions, and two deaf native signing 
adults participated in this stixy. Four groups of native signers (ages 3, 
5, 7, amd 9), and two groups of late signers (ages 5 and 9) all attended 
the California School for the Deaf in Fremont. Each group of children 
consisted of five subjects. Unfortunately, there were not enough late 
signers to include 3 and 7 year olds. All native signers have deaf parents 
and have been exposed to ASL since birth and all late signers have hearing 
parents and no other deaf family members. The late signers were first 
introduced to sane kind of sign language (usually a signed English system) 
between the ages of 2 and 4. 


The subjects were videotaped either at school or at hane. As one of 
several psycholinguistic tasks, the subjects were asked to first look 
thrugyh, then sign, the story from the picture book "Frog, Where Are You?" 
by Mercer Mayer Jew Yor;: Dial Press, 1969). They were allowed to look at 
te pictures while tellin; the story. All videotapes were transcribed and 


coded by the researcher (a hearing native signer) and an ASL researcher (a 


deaf native signer). 


Seven verbs vice chosen for this analysis. The ability to use 
morphologically complex verbs has been shown to be a reliable indicator of 
fluency in ASL (Supalla, 1982). The seven verbs chosen were same of the 
most cotimen verbs in the story and were chosen because they were used by 
all age oxoups and because they represent different kinds of verbs in ASL. 


Comparing the performance of native and late signers was difficult 
because of the small mumber of subjects (five) per group. To equalize each 
subject’s contribution to the data, scores were averaged within verbs for 
each subject. Each subject had a maximm of seven scores, one for each 
verb. This procadure also controls for individual differences in story 
length. With five subjects per cell, there was a maximm of thirty-five 
data points per group. It was not uncammon for subjects to not use one of 
the seven verbs at all in their rendition of the story. This was coded as 
an anission. Onissions were excluded from my calculations, which made 
statistical computations difficult because of unequal sample size between 
age groups and the smvll mumbers overall. Consequently, significance 
testing was not performed on these data. Instead, frequencies of 
occurrence of particular forms are outlined. 


Results and Discussion: 


In this study there are striking differences between native and late 
signers on all measures relating to the internal complexity of the 
analyzed verbs. These measures include the use of space, classifier use, 
inflections for aspect, and the general internal campiexity of a sign. 
Measures unrelated to the internal complexity of a sign do not show the 
same kind of qualitative differences. For example, a simple count of the 
rumber of signs used by the subjects reveals very similar patterns of 
development. The average mmber of signs in a story increases with age for 
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native and late signers. This measure does not take into account the 
muumber of morphemes per sign, only the total ramber of discrete signs in a 


story. 
Development of Spatial Reference: 


The four spatial inflections examined in this study include signs that 
are; uninflected, inflected to the book, inflected to an arbitrary point, 
inflected to an arbitrary point with a previous reference. If the 
development of each type of spatial inflection is paralle! for native and 
late signers, then it can be posited that overall development of spatial 
reference is also parallel. 


The course of development for uninflected forms is very interesting. 
These results, shown in Figure 1, indicate that while 3 year olds use more 
uninflected forms of the analyzed verbs than the other forms, the 
frequency of uninflected forms increases at ages 5 and 7, then decreases 
dramatically at 9. Adults rarely use the uniflected form of the verb. 


Insert Figure 1 Here 


The inflecting of a verb to a book is not mentioned in previous 
literature and therzfore may not have been examined in previous ; 
The native signing 3 year olds use this form extensively while the older 
native signers do not use it at all. In this relatively immature use of 
spatial reference, it seems that the book, and specific pictures in the 
book, are being used as a physically present object, to which these verbs 
are being inflected. 


Native signing 9 year olds are establishing spatial loci for various 
referents and are beginning to maintain those loci over several 
utterances. This is reflected in their increased mmber of verbs inflected 
to an arbitrary point both with and without previous reference. Between 
age 9 and adulthood, however, the verbs with previous reference surpass 
those without previous reference. Adults are establishing and using 
locations in the signing space on regular basis at this point. 


The development of spatial inflections for late signers shown in 
Figure 2, is very different than that for native signers. Contrary to the 
native signers, the frequency of uninflected signs for late signers 
increases between ages 5 and 9 as the frequency of signs inflected to an 
arbitrary point decreases. 


Insert Figure 2 Here 


For the 5 year old late signers the high frequency of verbs inflected 
to an arbitrary point is due to the random placement of signs in the 
signing space. This interpretation of the data is supported by the 
corresponding low frequency of verbs inflected to an arbitrary point with 
previous reference. This could be diz to chance alone. The inflection of a 
verb to an arbitrary point with a previous reference is an indication that 
the sioner is beginning to use space linguistically. There is no evidence 
that the 5 year old late signers are using the signing space 
linguistically in this narrative. This is not to claim that late signers 
at this age would not understand or use some simpler uses of space such as 
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verb agreement for present referents (e.g., I-GIVE-YOU or YOU~GIVE-ME) . 
The developmental patterns of spatial reference by native and late signers 
are viewad as evidence of qualitative differences in development. 


Tllustrations 1 and 2 are examples of a native signer and a late 
signer commenting on the same picture. The dog is jumping on the tree, the 
beehive falls and shatters followed by the bees emerging from the bechive, 
new on the ground. Illustration 1 shows a 9 yaar old native signer signing 
the sentence: 


Illustration i 


DOG-PUSH-IT SMALL-ROUND~THING-FALLS-DOWN SMALL-ROUND-THING-SHATTERS 


This sentence uses the downward direction of the falling in the second 
sign. It is very clear that a small round thing, in this case the beehive, 
is falling dom. In contrast, Illustration 2 depicts a very different 
version of the same scene. Here a 9 year old late signer signs: 


Illustration 2 


DOG BEE OPEN MANY-SMALL-THINGS-FL. 


This sentence uses no spatial reference at all. We don’t know what opened, 
why it opened, or how the act of opening is connected to the dog and the 
bees. 


Development of Classifier Use: 
A second feature of ASL acquisition analyzed here is the use of 


classifiers to mark secondary objects and grammatical relations. Again, 
there are different patterns of development for native and late signers. 
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Results shown in Figure 3 indicate an overall increase in the number of 
classifiers used by native siqners ss they mature. Late signers, however, 
actually show a decrease in the frequency of classifier use. 


Insert Figure 3 Here 


Illustrations 1 and 2 also show the differences between the analytic 
use of classifiers and the use of unanalyzed forms. The sentence shown in 
Illustration 1 contains several classifiers. In the first sign the boy 
uses his body as a body classifier for the dog pushing the beehive. The 
second sign uses a C-shapa handshape as a Size and Shape Specifier (SASS) 
classifier for a small round thing falling. The last sign also uses a SASS 
classifier and shows the beehive shattering. 


In contrast, the sentence in Illustration 2 contains no explicit 
reference to the beehive. Only the last sign could be a classifier sign 
but even this is in doubt. Earlier in this girl’s rendition of the story 
she asked the test administrator what was happening in the picture. She 


responded: 
BEE MANY~SMALL~THINGS-FLY 


It appears as if this late signer simply copied this second sign as a 
whole. In support of this interpretation is her use of the sign OPEN (open 
a box), which indicates that she is not analyzing the camponents of this 
verb. The frozen form of the verb OPEN is taken from the prototypical 
action of opening a2 small cardboard box. Each hand is a SASS classifier 
for a wide-flat flap moving from a flat closed to an upright open 
position. This subject used this form to refer to tha opening of a window 
and the opening of the beehive after it fell. The verb OPEN has presumably 
not yet been analyzed by this subject. 


Development of Aspectual Inflections: 


The overall frequencies of aspectual inflections for the different 
graups shown in Figure 5 indicate that for the native signers, 
inflections are acquired relatively early. The performance of the late 
signers is very different than that of the native signers. Late signers do 
not seem to be paying attention to the variations in the sign’s movenent, 
or if they are, they are not incorporating that variation. The sharp drop 
in frequency of aspectual inflections by 9 year old late signers would 
again indicate a qualitative difference in linguistic processing between 
the native and late signers. 


. Insert Figure 4 Here 
Development of the Internal Complexity of a Sign: 


The development of the three morphological subsystems can be better 
understood if we consider how they are integrated in the sign itself. The 
morphemes included in this measure of internal complexity are: 1) the verb 
itself, 2) any classifiers used to mark a subject, cbject, or grammatical 
relation, ard 3) any aspectual inflections articulated simultaneously with 
the verb. Figure 5 shows how the number of morphemes within a verb 
increases for the native signers, but net for the late signers. 
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Insert Figure 5 Hare 
Devalopment of Contextual Complexity: 


It is assumed that the complexity of the utterance is a reflection of 
cognitive development. The sentence in which the analyzed verb occurred 
was coded as: 1) a label (FALL), 2) a description (BOY FALL or FALL 
WATER), 3) a relationship (BOY FALL WATER or JAR-OVER-HEAD), or 4) a 
causal relationship (DEER STOP, BOY FAIL). Figure 6 reveals paraliel 
patterns of development for both groups with late signers exhibiting an 
approximate two year delay. 


Insert Figure 6 Here 
Conclusions 


This pattern of development indicates that native signers use the 
morpheme as their basic linguistic unit of analysis, while late signers 
use the lexical item itself. This conclusion is supported by the finding 
that the mmber of morphemes per sign increases with age for the native 
signers, but not for the late signers. This difference between native and 
lata signers does not exist at the sentence level. Both native ard late 
signers use increasingly complex sentences as they mature. The interesting 
finding here is that late signers show cognitive develogment without 
morphological development. 


These findings support Newport’s claims that late signers treat signs 
as gestalts, rather than as independent, simultaneously produced systems. 
Late signers, being first exposed to signs when they are cognitively more 
developed, are able to see the sign as a whole. They then use the sign, 
rather than the morpheme, as their basic linguistic unit of analysis, thus 


impairing later morphological development. 
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THE FUNCTION OF THE OBLIGATORY CONTOUR PRINCIPLE IN ENGLISH: 
EVIDENCE FROM CHILD LANGUAGE* 


Heather Goad 
University of Southern California 


1. Introduction. 


morphemes ; 
acalisition of inflectional allomorphy. In particular, I focus on the 


Obligatory Cortour Principle (OCP). My goal is to determine how the OCP 
operates in Emylish, both in acquisition and in adult grammar. 

I attempt to account for the stages in the acquisition of the [ez] 
allomorph as well as for the late appearance of [az] vis-4-vis the other 
plural allamerphs. Most of the discussion centres on a type of error 
which I term ‘gemination' where the child simply adds (s] or [z] directly 
to the root, yielding {horss} and [{rozz), for instance, as the plurals of 
horse and rose respectively. I suggest a parametric account of the 
operation of the OCP to explain both gemination and the other stages in 
the acquisition of the fez) alloamorph. 


2. Plural allomorphy in English. 

The English plural has three phonologically comditioned allomorphs, a 
syllabic allamorph, [az], and two nonsyllabic allomorphs, [2] amd [s]. 
Their distribution is as in (1). 


(1) Distribution of English plural allanorphs 
[z] after sonorants: e.g. dane, tail, shoe 
[z] after voiced nonsibilant obstruents: e.g. bed, dog, sieve 
{s] after voiceless nonsibilant obstruents: e.g. cap, lake, bath 
{az} after sibilants: e.g. horse, dish, judge 


Crucially, the syllabic allamorph (3z] appears after roots ending in 
sibilants. The OC? is responsible for the intrusive [9}; it functions to 
break up sequences of (nearly) identical segments which would otherwise 
end up adjacent to one another. 


3. The Obligatory Contour Principle. 

I will briefly discuss those aspects of the phonological theory I am 
assuming which are relevant to the formulation of the ocP. In current 
nonlinear theory, phonological representations are muitiplarar. 
(actually bundles of hierarchically organized features) are linked to a 
central core consisting of skeletal slots. The slots, noted by Xs, 
indicate the mmber of segments in the representation. See (2). 
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(2) »mlimwar representation of stitch 
xXxxXX X skeletal tier 


\ 
Vila) segmental/melodic tier 


aAffricates are represented as two segmental matrices linked to one 

skeletal slot (Clements & Keyser 1983). In (2), [tS] consists of one 
matrix which is identical to that for [t] and another which is 

identical to that for [5]. It is thus not surprising that [t3) and [dz] 
pattern with [8] and [Z] in the plural. 

The ocP is given in (3). It disallows sequences of identical 
elements on the segnental tier where ‘element’ refers to whole segments 
or features, C~pending om the language, It is claimed to be universal. 


(3) Obligatory Contour Principle (McCarthy 1986:208) 
At the melodic level, adjacent identical elements are prohibited. 


Languages may allow adjacent identical elements, but the OCP 
that they share melodic structure at the point where the violation would 
cccur. For exanple, if a language allows geminates, the OCP 
that al] melodic material be shared. The representation must be as in 
(4)A and rot as in (4)B. 


(4)A. Licit representation of pp B. Tllicit representation of pp 
xX xX *X X 
\/ | | 
Pp P P 
The OcP is not simply representational in nature. For instance, it 
has long been known that geminates cannot be split by rules of epenthesis 
(Kenstowicz & Pyle 1973). ‘The OCP disallows epenthesis into geminates as 
the association lines between the skeletal tier and the segmental tier 
would necessarily cross; this is prohibited by the theory of nonlinear 
phonology (Goldsmith 1976, inter alia). See (5). 


(5) is between 
#X XX or *X XX 
\ X/ 


pe ap 


representation of the bimorphemic sequence CVC;~-C; is as in (6). 


- #3 : : p . - gt 
a. pees 
7 a. ‘ 3 , he SS 


alee 
Iz. 


Li? 


»,4 

| 

Cc 
Although the final consonants in (6) are identical, they are not adjacent 
at this stage in the derivation and so are not yet subject to the OCP. 
ee 2 ere ee ee ee 


cn. 
In languages which allow geminates, in ccnformity with the OCP, the 
two identical segmental melodies are automatically merged as a result of 
plane conflation. The post plane-conflation representation of Ci,-C; is 
then as in (7). ‘What started out as a bimorphemic form in (6) is tow 
le from the corresponding monomorphemic forn, cve,C,. 


(7) Post plane-conflation representation 
XX XX 


fe) rormmmen 
< 
2 


In languages which do not allow geminates, how the OCP operates 
across morpheme boundaries is a contentious issue. Some researchers 
(e.g. McCarthy 1986; see also Borowsky 1987) believe the OCP functions to 


/az/ as in (8)A. The {9} deletes everywhere except where an OCP 
violation would occur. Thus, [3] deletes in roads but not in roses. In 
the triggering analysis in (8)B, the underlying representation of the 
plural morpheme is assumed to be /2/; epenthesis cccurs whenever there is 
an OCP violation. ‘Thus, epenthesis occurs in roses but not in roads. 


(8)A. OCP as blocker B. OCP as trigger 
[roz-az/  /rod~az/ {Yo2z-zf /reda-2/ 
< Mi [a] epen 1 ae 


Fs) r) 
{rozaz)} {rodzj (rozaz}_ = [redz} 


I will arque below that the acquisition dsta lead to the conclusion 
that in English the OCP operates to trigger [a] epenthesis rather than to 
block [3] deletion. 


{a} del 
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4. Acepisition of the (32) allomorph of the plural. 

Moch research has focussed on the acquisition of the plural beginning 
with Barko's (1958) seminal paper on the subject. She found that the 
(az) allomorph is the most difficult for children. ‘This finding has been 
confirmed by many other researchers, both for data collected naturally 
and for data collected y (e.g. Anisfeld & Tucker 1967, 

& Anisfeld 1969, Brown 1973, Ervin 1964, Miller & Ervin 1964, 
Hecht 1983). 

A morphological explanation has often been provided for the late 
acquisition of [az] (e.g. Berko 1958, Baker & Derwing 1982, see also 
Bybee & Slobin 1982). ‘The child initially assumes sibilant-final roots 
are inflected and, as a result, adds no further inflection. 
Berko states (p. 173): "a final sibilant makes a word plural". Despite 
the fac*. that this analysis intuitively seams correct, it is problematic 
for two reasons. One, the forma! implication is that sibilant-final 
roots saab ‘wo allamorphs. Rose, for instance, would have one allomorph 


another strategy, gemina 
mentioned in section 1, in gemination, the child adds [{s} or {z] directly 
to the root, yielding (rozz] as the plural of rose, for instance. I use 


palatal sibilants) . 

Although gemination does not appear to be very common, the fact that 
it exists mist be accounted for because it results in a violation of 
English phonotactic constraints; English does not allow tauto-syllabic 
lang consonants. Actually, gemination may be more common than has been 

i Some examples may have bean misanalysed as deletion since 
English speakers tend to hear identical consonant clusters as short. 

T have extracted examples of gemination from Berko (1958) and Hecht 
1983). The sibilant-final roots used in the two studies are provided in 
(9); those which underwent gemination are indicated. 


(9) Examples of gemination strategy 
A. Berko (1958). 
Subjects: 80 children, 4-7 years of age 
Sibilant-final nonsense words: tasg [(taes] nizz [ntz] 
; q@utch (gatS] kazh [kaezZ] 
Sibilant-final real words 


: glass 
Examples of gemination: 10% of subjects - plural of gutch 
5% of subjects - plural of kazh 


ae 


ae 
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B. Hecht (1983). 
Subjects: 42 children, 2710-5710 years of age 
Sibilant-final nonsense words: dags [{daes} foog [fuz] 

yasb (m5) gutch [gats) 
Sibilant-final real words: bus rose bush wate) 
Examples of gemination: 10% of subjects - plural of :qitch 
2% of subjects -~- plural cf watch 


cone ve-year old), 
gemination may be the first stage in the acquisition of [92]. Deletion 
would then be the second stage. This order of acquisition can only be 
the cross-sectional data; Phagiear ad a longitidinal study 
order. 


5. Phonological account of the late acquisiton of [az]. 
I provide a phonological explanation for the late acquisition of the 
[az] allomorph, one that is rooted in the operation of the OCP. Racall 


McCarthy 
constraint. It is crucial for his analysis that the underlying 
tion of the plural morpheme be foz/. If it were simply /z/, he 
would incorrectly predict that that the plural of rose was [rozz), as 
illustrated in (10). 


(10) Derivation of *{rozz} a la McCarthy 
A. Pre plane-conflation B. Post plane-conflation 


For McCarthy, there can be no intermediate stage (11) between (10)A 
and (10)B because this would violate the OCP. The form in (11) has 
undergone plane conflation but each [2] still has its own melody. 


(11) Stage intermediate between (10)A and B 
aX X 


N-—— 
NN 3G 
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If (11) ware allowed, epenthesis could apply at this stage yielding 
the correct [rozez]. Epenthesis cannot take place after the planes are 
conflaced in (10)B because this would result in a violation of the line 
crossing convention as in (12). 


(12) Epenthesis after plane conflation 
* 


aX XXX X XXXXX 
|| \X o | | X/ 
ro z3 roez 


Yip £1988) argues for a weakened version of the OCP by allowing for 
the intermediate stage in (11). In fact, she claims that this i11~formed 
representation is exactly what triggers rules like (9) epenthesis. For 
her, thon, the underlying representation of the plural morpheme is /z/. 
The derivation of roses proceeds as in (13). 


(13) Derivation of [rozaz] & la Yip 
A. Pre conflation B. Post conflation Cc. Epenthesis 


Yip views the OCP as an active constraint whose function 
tion as ill-formed, thereby requiring that it be repaired. 


cbetruent-final stems and [z] to sonorant-final stems. Two, if the 

underlying representation were /az/, we would expect the syllabic 

allomorphs to be acquired first since no rules need apply to derive the 

surface representations. Three, as the presence of {a} Kreme up two 
» might 


1? 


a 
. Iz. 
7 CY 


Recall that if the underlying representation is /9z/, the [3] must delete 
everywhere except if an OCP violation were to result. Nonalternating 


syllabic-nonsyllabic alternations exhibited by the plural. However, 
their vowels never delete as shown in (14) for the superlative. 


(14) Superlative inflection 
biagest —> [bigest], *(bigzd) ripest —> (raipest), *{raipst] 
thinnest —> [@inast], *[@:nzd] nicest —> [naisast] 


This indicates that the underlying representation of the plural mst be a 
single consonant only, /2/. Additional evidence that the plural and 


syllabicity) is that children never ize deletion to the 
ungrammatical forms in (14), yielding *[btgzd] for instance. 
The gemination strategy is once the constraint 


assumption about the operation of the OCP as a passive constraint, the 
child opts for deletion. The plural [(s/z] is not syllabifiable after 
sibilant-final roots due to the constraint and is therefore subject to 


Ultimately, the child must acquire the adult representation. ‘The 
deletion stage persists for same time. I suggest that this is because, 
to maintain the assumption that the OCP operates as a passive constraint, 
the child may only reach the adult stage if s/he postulates /e2/ as 
underlying. However, as mentioned earlier, there is evidence against 
this. The only other alternative is for the child to assume the OCP 
operates as an active constraint. ‘Thus, as a result of Plane conflation, 
the two segnental matrices are not inmediately merged; instead, each 
hr rl ge ore i eect The OCP operates at this stage to 


parametrically determined. Note that the OCP’s status as a universal 
constraint on representations is unaffected; only its operation, as a 
passive constraint or as an active constraint, must. be determined on a 


in the form of [9] epenthesis. 
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Note. 
*I would like to thank Pat Clancy, Te a Joseph 
Stenberger and the participants at at the 1989 Stanford Child Language 
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WH-QUESTIONS AND EXTRACTION FROM TEMPORAL ADJUNCTS: 
A CASE FOR MOVEMENT 


Helen Goodluck, Julie Sedivy and Michele Foley 
University of Ottawa 


This paper reports the results of an experiment testing children's 
knowledge of the constraint thet prevents questioning from a position inside 
a temporal adjunct, i.e. knowledge of the ungrammaticality of questions such 
as, 

1. *Who did Fred kiss Sue before hugging _? 

We find that children as young as three years show awareness of the 
constraint. We interpret these results as evidence that children form 
questions in English by movement (by an operation linking deep structure to 
surface structure) . 


I. Islands and movement 


Languages are widely agreed to differ in whether they have movement “in 
the syntax", linking deep and surface structures. A standard diagnostic 
of a language with movement in the syntax is obedience to structural 
constraints. Languages which form questions and other sentence types by 
dislocating an element from its underlying position tend to obey "island" 
conditions, limiting the positions from which an element may be moved. 
The block on movement from within a temporal clause is one such structural 
condition. Languages such as inglish, which form questions by dislocating 
the question word to the front of the sentence obey the constraint 
exemplified in (1); languages such as Chinese and Japanese, which do not 
form questions by dislocation of a question word, allow questions (including 
the equivalent of (1)) that are ungrammatical in English-type languages. 
(For pertinent discussion, see, for example, Huang 1982). We chose the 
temporal island to extraction as a focus of our test because it appears to 
be a very good indicator of language type. That is, unlike some other 
islands (for example complex NPs), it seems to be relatively immune to 
exceptions -- languages which form questions by dislocation of elements from 
their underlying position obey the constraint, those that do not form 
questions in that way allow questions equivalent to (1). The temporal 
island constraint is also explicable in terms of more general principles of 
grammar that putatively govern movement rules (specifically, subjacency), 
although this is not unproblematic (for recent discussion, see Chomsky 1986; 
Lightfoot and Weinberg, 1988). 


II. The experiment! 
IIl.1 Materials, subjects and procedure 


Our experiment uses a picture-cued question-response task of the kind 
used by deVilliers, Roeper and Vainikka (1988). The child listens te 
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"stories" about animals; each story consists of a sequence of four 
sentences, each accompanied by a picture. While looking at the Last picture 
the child is asked a potentially ambiguous question, designed to probe his 
knowledge of syntactic conditions on movement. 

There are three conditions in the experiment, with three stories for 
each condition. A sample story from each condition is given in (2A-C). 


2. A -- Temporal adjunct island 
Story: The fox ran down to the river 
First he ate an ice-cream cone 
Then he whistled a tune he'd heard on the radio 
The fox felt pretty happy. 
Question: What did the fox eat before whistling? 
Potential answers -- Upstairs: icecream 
-- Downstairs: a tune 


B -- ask plus complement in the verb phrase 
Story: The zebra was feeling happy 
He just wanted to hug and kiss everyone 
The zebra asked the lion: "Shall we kiss the 
monkey?" 
The zebra was a kind animal 
Question: Who did the zebra ask to kiss? 
Potential answers -- Upstairs: the lion 
-- Downstairs: the monkey 


C -- ask plus temporal adjunct island 
Story: The elephant liked to work 

She asked the tiger: “Shall I help the horse carry 

those boxes?" 

The tiger said "Yes!", so the elephant helped the 

horse. 

The elephant was tired at the end of it all. 
Question: Who did the elephant ask before helping? 
Potential Answers -- Upstairs: the tiger 

-- Downstairs: the horse 


For all three conditions (A-C), the verbs in both the main and 
subordinate clause ere optionally transitive. As a result each question 
has two logically possible answers - an “upstairs” answer, where the 
question word is taken to refer to the object position of the main clause, 
and a "downstairs" answer, where the question word is taken to refer to 
the object position of the embedded clause. In conditions A and C, where 
the embedded clause is a temporal adjunct, only the upstairs option is 
permitted in the adult grammar of English.® In condition B, where the 
embedded clause is a complement to the main clause verb phrase, the question 
is genuinely ambiguous in the adult grammar, and either upstairs or 
downstairs answers are permitted. 

Our test of knowledge of the temporal island condition involves 
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evaluating children's perforwance across the three conditions. Under the 
temporal island condition, we expect only upstairs answers to conditions A 
and C. Condition B gives us a control that preference for upstairs answers 
to conditions A and C is not simply a reflex of a preference for making the 
question word refer to a position inside a main as opposed to subordinate 
clause. In combination, conditions A, B and C give us some measure of the 
generality of the constraint--i.e whether or not apparent obedience to the 
constraint can be explained in terms of lexical preferences (preferences for 
particular verbs being construed as transitive). The adjunct island 
conditions A and C vary the type of predicates involved (two action verbs 
such as egt for A and one action verb and the verb ask for C). Since the 
same predicate types were used in both B and C, any difference in amount of 
downstairs answers for B vs. C is unlikely to be due to lexical preferences 
concerning whether particular verbs are transitive.? As an additional 
control against lexically-dependent results, half the subjects received 
materials with temporal adjuncts introduced by the preposition before and 
half received materials with the preposition after; thus for half the 
subjects the probe questions for the examples of conditions A and C were as 
given in (2) and for the other half the questions were, respectively, “What 
did the fox whistle after eating?" and "Who did the elephant help after 
asking?"). 

Thirty 3-5 year old children were tested (10 threes, 10 fours and 10 
fives). The pictures for each story were bound in a 8 1/2 x 11 1/2 inch 
folder; the experimenter (JS) turned the pages of the folder for the child 
as she read the story to the child. The folders were shuffled to produce 
an individual order of presentation for each child, with the constraint that 
no runs of more than two stories in any condition were permitted. If a 
child did not respond on first presentation of the story, the folder was 
returned to at the end of the experiment for a second (and, if necessary, 
third) trial. Sessions were tape-recorded. 


II.2 Results 


Table 1 gives the results of the experiment in terms of the percentage 
upstairs, downstairs and "other" responses for the three conditions, by age 
group. We accepted as upstairs or downstairs answers responses that were 
not completely faithful to the content of the story, but which were 
plausibly an answer toe one but not the other of the two predicates in the 
question (for example, the answer “a song" would be scored as a downsrairs 
answer to the question for the story in 2A). In all, there were 26 non- 
exact answers of this type (9.5% of the total data, including failures to 
respond) that we accepted as upstairs or downstairs answers. 

The percentage figures for conditions A and C (Table 1) show that by 
age five years children are very clear-cut in their obedience to the 
temporal island constraint; moreover, the results for condition 5 indicate 
that this is not simply the result of a preference for extraction from the 
main clause. The results for the three and four year olds are less clear- 
cut, but nonetheless show the same pattern as that for five year olds. As 
the figures in Table 1 show, the main development over age was for Other 
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responses to be replaced by upstairs responses, for all three conditions. 
There was a significant main effect of age for upstairs answers 
(F(2,27)~9.65, p < .001) but not for downstairs amswers (F(2,27)=1.72, p 
< .20). Whether the child received materials with the preposition before 
or after had no significant effect of proportion of correct (upstairs) 
responses for either condition A or C (there was a very small trend in favor 
of more correct responses with before). 


TABLE 1 
Percentage Responses 
by Age 
Condition A Condition B Condition Cc 
U D OT U D OT U D OT 
3 yrs 50 13 36 37 43 20 47 20 33 
4 yrs 60 7 33 23 «53 23 70 7 23 
5 yrs 90 3 7 53 40 7 100 0 9] 


~weoeoneoee ee ee ee aa 


U = Upstairs; D = Downstairs; OT = Other 


Table 2 summarizes the distribution of children in terms of individual 
response patterns over the three conditions. We took as criterion for 
knowledge of the constraint a response pattern with a least one downstairs 
response for condition B and beth a greater number of upstairs responses for 
condition A than for condition B and a greater number of upstairs responses 
for condition C than for condition B. Of the 20 possible patterns of 
responses that were "pass" patterns by our criterion, 13 were exemplified 
among the 19 children who met criterion. 


Table 2 
Number of Children with 
"Pass" Response Patterns 


3 years 4 
4 years 7 
5 years 8 


eeoew®eaenewrennereeceeerrewerer ee ene ene 


We wish to argue that our results indicate that by age three years 
children obey the adjunct island condition. By the logic that takes 
obedience to the constraint as a diagnostic of movement in the syntax, 
children at this age are forming questions by a movement operation of the 
same type as that in the adult grammar of English.® 


TIT. Discussion 
The development of movement rules has recently become a hot topic in 


language acquisition studies. A series of questions and proposals have 
come out of the literature of the last three or four years, including: 
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1. At @ young age (say, before four years) chiidren learning English- 
type languages may lack movement (Roeper 1986; deVilliers, Roeper and 
Vainikka 1988). 

2. Within languages with movement as an operation between deep 
structure and surface structure, movement may develop at a different pace 
in different languages and different constructions (Weissenborn 1988; 
Labelle 1988; Goodluck and Behne 1988; Wexler 1988, 1989). 

3. Constraints on movement may be violated (Wilson end Peters 1988). 

We will not attempt to discuss all of this literature, but we need to 
address a basic question. We have argued that children as young as three 
obey the temporal adjunct island constraint and from that we have concluded 
that children at that age have established movement between deep structure 
and surface structure as an operation in their grammars. Now can this be 
reconciied with the claims above, particularly claim (1) (that children may 
lack movement) and claim (3) (constraints may be violated)? 

Roeper's (1986) claim that children may lack movement was based on 
the fact that some children aged 5-7 interpreted who and the pronoun he as 
coreferential in an experiment that tested strong cross-over sentences such 
as, 

3. *Who, does he, think has a hat? 

The ungrammaticality of such sentences (on the coreferential reading) is 
generally taken to result from a violation of principle C of Chomsky's 
binding theory (see Lasnik and Uriagereka, 1988 fer summary and discussion) ; 
that is, from the ungrammaticality of binding the trace of who with the NP 
he. If children do not have movement, and hence do not have traces, 
sentences such as (3), might be permitted without any violation of 
grammatical principles; Roeper proposes that an unpronounced pronominal 
element (pro) may occupy the position of trace in the child's grammar. 

We do not have any neat solution to Roeper's data, but we believ: the 
weight of the evidence supports movement (and traces/bound variables). In 
addition to our results, in one experiment Roeper found that children who 
gave answers in violation of the cross-over constraint also gave multiple 
answers to questions such as (3), an interpretation that (as Roeper notes) 
will surely require a variable in the linguistic representation. Possibly 
the errors made by the children in Roeper's experiments were some kind of 
performance error. (C. McKee informs us of pilot data from an experiment she 
is conducting with D. McDaniel that suggests children's errors with strong 
cross-over are performance-based).‘’ 

Wilson and Peters 1988 report the case of a three year old child who 
went though a period of moving a noun to the front of the sentence, 
stranding in some cases determiner material (4a), and in others leaving 
part of a compound word behind (4b), 

4.a What did I get lost at the, Dad? 

(cf. I got lost at the store) 
b What are you cooking pan? 
(cf. You are cooking pancakes) 
Taking island constraints as diagnostic of movement, these examples are 
not in fact evidence against movement, but evidence of an incorrrect 
conception of what gets moved. As Wilson and Peters observe, errors such 
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as those in (4) will result if the child is not aware that maximal 
projections, rather than heads, are what move. This does not, however, 
really lessen the oddity cf the errors in (4). It is tempting to draw 
comparison with various types of incorporation processes in many languages, 
whereby nouns may be incorporated into verbs, stranding determiners, etc. 
(see Baker 1988 and references therein for many examples). However, the 
analogy is no more than an analogy. Incorporation is 4 
lowering/morphological operation. The movement in (4) involves raising, 
presumably into Comp; no English-type languages we know of permit such 
questions and possibly such questions are ungrammatical in any language. 


IV. Summary and conclusion 


By the test of obedience to an island constraint, we have argued that 
children learning English have movement in the syntax as early as three 
years. Thus children aged three have formed a grammar that is similar in 
a fundamental way to the adult grammar they are exposed to. This does not 
mean that they have nothing left to learn. Data such as Wilson and Peters’ 
presents an interesting challenge for a common assumption in language 
acquisition studies -- that is, that the child's developing grammar should 
always fall within the range of language systems exemplified in adult 
grammars. Children should not have "rogue grammars", to borrow a term from 
Finer (1989). Whether such a constraint on child grammars can be sustained 
in its strongest form in the face of data such as Wilson and Peters’ seems 
to us an interesting question to pursue. 
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Footnotes 


1. Ours is not the first attempt to establish children's sensitivity 
to islands, particularly subjacency. Several studies have looked at 
children's sensitivity to constraints on extraction from NPs, with results 
that are largely ambiguous cr indeterminate (see Otsu 1981; Crain and Fodor 
1984: Goodluck, to appear). Perhaps this reflects the fact that the 
constraints tested in those studies are subject to a degree of cross- 
linguistic and even ind2 vidual-speaker variation even within languages with 
movement. Such variation suggests that the superficial form of the 
constructions involved may be amenable to more than one analysis, and thus 
present a special challenge for the learner who has to figure out which is 
the correct analysis for the language/dialect s/he is learning. 

2. A complete set of materials is available on request. 

3. The temporal complement structure in our materials is ambiguous 
between a verbal and nominal gerund structure. This ambiguity does not 
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affect the basic point we make concerning islands and movement, although 
the details of structure that the children are reacting to remains to a 
degree indeterminate in a way that may have substantial consequences for 
understanding other areas of development (see Wexler 1989). 

&. In fact, condition B contained two stories with the matrix verb 
ask and one with the matrix verb want. Thus upstairs answers for one token 
in the condition involved extraction from subordinate subject position, 
rather than matrix object position. We included one token with want since 
we were initially not confident that ask would produce encugh lower object 
extractions to provide the control we needed (see below); deVilliers, 
Roeper and Vainikka (1988) report 70% of responses to questions exactly of 
the form in 2B involved taking the question word as main clause object. We 
obtain a somewhat higher proportion of lower-object interpretations with 
condition B (s.e below). Each token contributed to the total lower object 
extractions, with one token with ask contributing somewhat less than the 
other token with ask or that with want. 

5. We were voarticularly concerned that in conditions A/C children 
might jump at the nain clause object position as a location for the wh- 
word, without listening to the whole question (i.e. they would plug the 
wh-word in as object of the main clause as soon as the matrix verb had been 
input). Literature on sentence processing from Fedor (1978) onwards argues 
that this is a real possibility and it may be the source of some correct 
answers for conditions A/C, although it plainly cannot account for any 
difference we find between B and C. One respense was categorised as 
"Other" because the child clearly jumped the gun and responded before the 
experimenter had finished the question. 

6. We leave open here whether those errors that do occur for 
Conditions A and C can be dismissed as performance errors; an error on 
these conditions is always compatible with a recency effect (responding to 
the last-mentioned predicate in the test question). We included a follow- 
up question to condition C, to test shether errors correlated with generally 
poorer recall of the story. Due to administrative error, we do not have 
complete data for the follow-up and can draw no firm conclusions from it. 

7. In a longer version of this paper (ms. April 1989) we sketch a 
combined competence/performance account of such errors. 

8. DeVilliers, Roeper and Vainikka (1988) discuss other data in the 
context of a no-movement account of early grammars. For reasons of space 
we will not discuss their arguments here, although we believe their data 
is accountable for under a grammar with movement. Roeper and deVilliers 
(1989) present some data on extraction from small clause vs. nominalization 
structures for 4-6 year olds that they suggest is evidence for movement. 

9. A less radical analogy might be between movement of N and V 
raising, as in some analyses of the verbal system of French and other 
languages (for example, Follock, 1987). 
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THE SONORITY CYCLE IN THE ACQUISITION OF PHONOLOGY 
Gita Martohardjono 
Cornell University 
0. Introduction 
It is well-known that children use syllable structure as a unit of organization from the 
earliest stages of pangueee development. In this paper we show that the development of 
syllables over time follows a constraint which been observed to hold on syllable 
structure cross-linguistically. We suggest that this can more generally be viewed as the 
result of a constraint on the mental representation of language, i.e. a principle of Universal 
Grammar, as proposed by Chomsky (e.g. 1986). In section 1., we introduce a recently 
Constraint on Syllable Structure, namely the Sonority Cycle (Clements to appear, 
enceforth SC) and illustrate how it accounts for syllabic structure across languages. In 
section 2. we show how the SC makes several predictions for child language acquisition, 
when viewed as a principle of Universal Grammar. In section 3. we present some data 
from the literature as well as from an ongoing longitudinal study to suggest that the 
predictions for acquisition are borne out, thus providing preliminary support for the 
postulation of the Sonority Cycle as a constraint on the mental representation of language. 


1. The Theory: The Sonority Cycle 

In phonological theory, sonority, defined as a scalar feature distinguishing various classes 
of segments, was conceived primarily to explain preferred patterns of syllable structure that 
have been observed cross-linguistically. The Sonority Sequencing Principle, originally 
found in the work of Sievers (1881) and Jespersen (1904), states that within the syllable, 
segments should increase in sonority as one proceeds from the ins to the peak. The 
constraint we consider in this paper is a reformulation of the Sonority Sequencing 
Principle, namely the Sonority Cycle proposed recently in Clements 1988. It states that the 
preferred syllable type rises sharply in sonority at the beginning, but drops gradually 

1, 


toward the end giving the nattern illustrated in 


l. The Sonority Cycle (Clements 1988) 
",..the sonority profile of the preferred syllable type rises sharply at 
the beginning and drops slowly toward the end.” 


In this section we give a brief overview of the definitions and principles underlying the 
Sonority Cycle. Clements proposes that the sonority rank for each class of segments be 
derived from a set of binary features and measured in terms of the sum of the (+] 
specifications for each feature. He th.s arrives at the classification in 2. whereby 
obstruents rank lower in sonority than nasals, nasals rank lower then liquids, liquids rank 
lower than glides and glides rank lower than vowels. 


2. Major Class Features in the Definition of Sonority (Clements 1988) 


O< N<« L<« GeV 

E - + syllabic 

. : ~ + + vocoid 

i - + + + approximant 

7 + + + + sonorant 

0 i 2 3 4 rank (relative sonority) 
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O = obstruents, N = nasals, L = liquids, G = Glides, V = Vowels 


1.1. The demisyliable 
In Clements’ , the basic unit for which sonority is measured is not the syllable 
itself, but the demi-syllable, i.c. initial and final demisyllables as defined in 3. below. 
Here, the syllable is divided into two overlapping parts, sharing the nucleus. 


3. " A demisyllable is a maximal sequence of tautosyllabic segments 
of the form Cm ...CnV or VCmm...Cn, where n> m > 0." 


Thus, in the closed syllable mat , the initial demisyllable consists of the sequence ma while 
the final demisyllable is formed by the sequence az. Using the demisyllable as opposed to 
the syllable allows for a differentiation in the definition of optimality for onsets on the one 
hand and codas on the other, a phenomenon that has been attested cross-linguistically (cf. 
Greenberg 1978). Each demisyllable type is assigned a value D, measured in terms of the 
dispersion in sonority within it as seen in 4. 


4. D values for Initial and Final Demisyllable Types 


Dp fF! D 
OV VO = 0.6 
NV VN = ld 
LV WL = rs) 
GV VG = 1.00 


The difference in optimality between initial and final demisyllables is formalized in the 
Dispersion Principie which states that the preferred initial demisyllable minimizes D, while 
the preferred final demisyllable maximizes D. Demisyllable types can now be ranked for 
optimality or conversely, for complexity, in terms of their sonority profiles, resulting in the 
ranking iaustrated in 5. 


5. aa ial Rankings for Initial and Final Demisyllable Types 
DB. Ce FD 


Cc 
OV i VG 1 
NV 2 VL 2 
LV 3 VN 3 
GV 4 VO 4 
ID = Initial Demisyllable FD = Final Demisyllable 
C = Complexity 


Here, the numbers indicate relative complexity, with the lowest number being the least 
complex (hence optimal) demisyllable type. The complexity measure is extended to one- 
member demisyllables ( i.e. consisting of one segment only): an initial demisyllable 
consisting of a vowel (e.g. in the syllable am, the initial demisyllable is the vowel @ ) is 
assigned the complexity measure 5; a final demisyllable consisting of a vowel (e.g. in the 
syllable ma the final demisyllable is a ) is assigned 0. 
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1.2. Cross-linguistic Preferences 
Several cross-linguistically observed phenomena can now be accounted for directly by the 
Sonority Cycle. In this paper we will focus on three: 


i. The CV syllable is the least lex, hence “unmarked” syllable type. 

If the complexity measure of a syllable is defined as the sum of the complexity measures 
of its initial demisyllable and its final demisyllable, it becomes possible to rank syllable 
types by the Sonority Cycle. In particular, the SC predicts that any syllable with an onset 
and without a coda, for example pa , has a lower complexity measure than any syllable 
without an onset and with a coda, for example ap. This can be seen by comparing the 
complexity scores in 5. above, for the most complex open syllable type to that for the least 
complex closed syllable type (see 6.) 


6. The complexity measure for any given syllable is the sum 
of the complexity measures of its id and its fd. 


The most complex CV is less complex than the least complex VC : 
most complex CV: GV = GV (4) + V(1)=5 
least complex VC: VG= V (5) + VG (1) =6 


Thus, the Sonority Cycle predicts the CV syllable to be the least complex syllable type. 


ii. The Sonority Cycle accounts directly for what has been termed the Maximal Onset 
Principle. This principle requires that the sequence VCV be syllabified as V.CV, 
rather than VC.V This can again be seen by a comparison of rankings for initial 
demisyllables and final demisyllables in 5. 


iii. The Sonority Cycle accounts for the Syllable Contact Law (Hooper 1972, Murray and 
Venneman 1983), which states that the preferred contact between two consecutive sy!iab.es 
is one in which the end of the first syllable is higher in sonority than the beginning of the 
second, thus showing a decline in sonority transsyllabically. 


2. The SC as a Principle of Universal Grammar 

As mentioned above, sonority was conceived primarily to explain preferred syllable types 
cross-linguistically. What is the relevance of the Sonority Cycle to acquisition? Clements 
proposes the Sonority Cycle as a universal (rather than a language-specific) principle, and 
imputes it to the implicit (rather than “conscious") knowledge of ers. It can thus be 
conceived of as 8 principle of Universul Grammar in the sense of Chomsky (e.g. 1986). . 
UG principles have in recent linguistic theory been defined as constraints on the mental 
representation of linguistic units. Extending this definition to the Sonority Cycle, it could 
be viewed as an initial constraint on possible syllable structure. This constraint would then 
subsequently be relaxed as the child is presented with exmaples violating it, that is, by the 
sheppard evidence that the language allows syllable types which diverge 
from the optimal type 4s specified by the Sonority Cycle. Under this view, the SC makes 
several verifiable empirical prediction for language acquisition. Extending the 
generalizations stated in i - iii above to acquisition it is predicted that phonological 
development should be guided by the following principles: 
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CV syllables should before VC syllables 
(b) a VCV sequence will syllabified as V. CV rather than VC. V 
the 


(c) preferred contact between two consecutive syllables is one 
where the end of the first syllable is higher in sonority 
than the beginning of the 
3. The Data 


In this section we will present some data supporting the above predictions. 


3.1. Primecy of the CV syllable 
The first prediction can be translated vo state that the CV syllable is the unmarked one, i.e. 
the one the child will start with, since it represents the optmal syllable type as defined by 
the Sonority Cycle. The literature on sara aan. widely attests a marked preference 
for CV syllables which extends from the ling period through early i speech, 
to later stages in acquisition when language-specific rule learning is well in progress (4-5 
yts). This preference has been noted in cross-sectional as well as longitudinal studies, in 
experimental studies observing larger groups of children (Winitz and Irwin 1959, Stoel- 
Gammon 1985, Ingram 1974) and in the classical diary studies of single children (cf. 
Leopold 1947, Velten 1943, Smith 1982 etc.}. Furthermore, it has been observed in 
children of many different language backgrounds (cf. Locke 1983, Jakobson 1968), and, 
somewhat surprisingly, in the babbling of deaf children (cf. Syken 1940). 


3.1.1. CV in the Babbling Stage 

It is of some interest that the late babbling period, i.e. the period of vocalization in the 
month (or so) prior to the onset of meaningful speech should be marked by CV syllables. 
This has often been taken as evidence that Jakobson's fundamental distinction between 
babbling and early: h was basically incorrect (cf. de Villiers and de Villiers 1974, 
Menn 19%2, etc.) The babbling period is described by Jakobson as containing "an 
astonishing arg and diversity of sound productions.” He cites Pap ea (1937) 
observation that at the height of the babbling period the child "is capable o producing all 
conceivable sound". The onset of meaningful speech, by contrast, is characterized 
according to Jakobson by a drastic reduction in the sounds produced.! He attributes this to 
the child's emerging system of phonemic oppositions. 

The claim of unconstrained babbling has since been challenged by many researchers (cf. 
Oller et al. 1976), who found that while infants do produce segments which are absent 
from the language of their particular linguistic community, such segments occur only 
occasionally. Preference seems instead to be given to those segments which also 
predominate early meaningful speech, i.e. stops and nasals. This has led most researchers 
to postulate a continuous transition from babbling to early speech rather than the abrupt 
qualitative change from chaotic sound production to structured vocalization hypothesized 
by Jakobson. This continuity seems to hold for syllable structure as well: For example, 


1 Ir should be mentioned that Jakobson seemed to restrict his observations to the 
production of segments rather than to syllable type. 
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Locke (1983) writes that the preference for CV syllables is “one of the more compelling 
patterns” in the babbling of infants as well as in early speech. Stoel-Gammon and Cooper 
(1985) report that in late babbling, CV syllables occur much more frequently than single 
V's or CVC syllables. This fact has also been pointed out by Cnuttenden (1970). 


3.1.2. CV Syllables in Early Meaningful Speech 

The evidence for the predominance of CV syllables in early child language is equally well- 
attested. Surveys of phonological processes in language acquisition (like Macken and 
Ferguson 1982, de Villiers and de Villiers 1974), etc. invariably point out this fact. To cite 
an example, Locke (1983) reports that in the 50 word vocalization of a Czech child, 92.8% 
of the items were CV syllables. 


The child's preference for CV syllables is attested by two phenomena: predominance of 
CV in spontaneous early utterances, and the structure changing processes which conform 
(adult) non-CV forms to CV patterns. The most commonly-cited processes in the literature 
in this regard are cluster reduction, final consonant deletion and reduplication, all 
characteristic of early child language cross-linguistically. Some examples from the 
literature are given below. 


7. (e)Final consonant deletion: 


English: 

bird --> bo (Ingram 1974) 

dog --> da (de Villiers and de Villiers 1974) 
goose --> gu (Branigan 1976) 

hi --> ha (Branigan 1976) 

bath --> ba (Locke 1983) 

French: 

place --> fa (Lewis 1936) 

Slovenian: 

bombon --> bo (Kolaric 1959) 


(b) Cluster-reduction by V-epenthesis: 


English: 
e.g. blue --> belu (Locke 1983) 


(c) Cluster reduction by C deletion: 
French: English: 
pied /pie/ --> pe (Lewis 1936) tree --> di (Smith 1982) 
taxi -> gegi (Smith 1982) 


Slovenian: 
mieko --> meko (Locke 1983) 
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(d) Beduplication of V in CVC forms: 
English (Ross 1937): 
back —> baga 


beach --> bihi 
dog --> dogo 


Each of these processes changes some adult sequence into a CV pattern. Under the 
hypothesis we are considering, the preference for CV syllables would be the result of an 
initial constraint on the mental representation of the syllable as defined by the Sonority 
Cycle. Support for our h thesis would be found if it could be shown that this preference 
is not somehow due to input, or to a constraint on articulation. While obviously such 
evidence is hard to find, there are some studies which suggest that our hypothesis is 
correct. Thus, in an early study of deaf children, Syken (1940) determined that CV 
lables predominate the babbling of deaf 3-4 year-old children, This would argue against 
hypothesis that CV syllables are preferred due to input (1c. their predominance in the 
speech of caretakers). Some support also comes from experiments on the perception of 
srg categories by young infants (2-6 months) which suggest that there is sensitivity to 
syllables even at that early a stage (cf. Eimas 1984, Kuhl 1980, Miller and Eimas 
1980). While this is only suggestive, such studies indicate that sensitivity to the CV 
syllable is to a certain degree independent of articulatory considerations, and would thus be 
consistent with the hypothesis that proclivity to certain syllable types is the result of a 
constraint on mental representation. 


3.1.3. Beyond CV Syllables 

Another fact which coheres with the prediction that the CV template constrains phonemic 
ition is the imbalance in the inventory of consonants in initial and final position, even 

after children produce CVC syllables freely. This has been noted by Stoel-Gammon 1985, 


will typically only have voiceless stops and perhaps one nasal in final position. Sioel- 
Gammon (1 


Furthermore, while stops, fricatives, and nasals appeared freely in initial demisyllables, the 
majority of consonants produced in final demisyllables (52%) consisted of /h/ and /?/. 
They also report a lower rate of accuracy in the production of consonants in fd's than in 
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consonants in sero paar This phenomenon can be explained under the view that CV 
but not VC is optimal by the Sonority Cycle. 


3.2. The Maximal Onset Principle 

We will now tum to some data supporting the Sonority Cycle as manifested by the 
Maximal Onset Principle. The data are from Stemberger (1988) who noted the following 
processes in the speech of his child Gwendolyn. 


3.2.1. Word-final resyliabification: 
G went through a stage where V-initial utterances were obligatorily preceded by an inserted 
glottal stop. n this became optional, there was a tendency for final consonants to be 
resyllabified with the os word if it began with a V. 
8. .find us --> [fai.nas] 
.-L00k at --> {je.tat] 
...amm is [au.mi:} 


3.2.2. H-fusion: 
From the age of 2;6.8 to 2;9.19 G resyllabified final voiceless stops with the following 
syllable if it began with /n/, ¢.g. 
9, ... about him --> [ba.thim] 
.. Fight here --> [wai.thi] 
..want hold --> [wa. tha:..] 


3.2.3. Liaison: 
G normally deleted word-final /d,z, nd, nz/. However, when the following word began in 
a vowel, she would pronounce them, resyllabified to id position. 
10. _—«.. head over --> [ha:. dou...] 

..untied it --> [2an. thai. dit] 

...Stand up --> [thi:. nap] 
In all the above, the strategy is to ‘repair’ initial demisyllables of the last optimal shape, i.e. 
GV or V (rank 4 and 5) by maximizing the onset with the addition of an obstruent or nasal, 
resulting in id's of rank 1 and 2. If this strategy turns out to be a common onc in child 
speech, this would provide strong evidence for a constraint on the shape of id's, as 


by the Sonority Cycle. 


3.3. The Syllable Contact Law. 

this section we present some data from Jenny, a child who participated in one of our 
ongoing longitudinal studies. Jenny was chosen because she did not have consonant 
clusters productively, and we were interested to find out if their ememrgence in her speech 
would in any way follow the ictions of the Sonority Cycle. Here we report only a 
small part of our findings, bearing evidence for what is predicted by the Sonority Cycle for 
transsyllabic consonant sequences. 


When we started the study, Jenny had no initial clusters, reduced certain medial clusters 

and had consonant + /s/ clusters in word-final position. Of interest to us are her medial 

oe Jenny reduced all her VO.OV sequences to V.OV. Some examples are shown 
ow. 
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lla. Jenny, age 3.2, Medial Clusters: 
VO.OV —> V.OV 


llb. VN.OV 
blanket --> [ban.tet] 
envelope --> [¢n.va.lot] 
rainbow --> [re*m.bo] 
dancing shoes --> [dan.sa if uz] 
monster --> [man.te] 


Obstruent deletion occurred both inside the word as well as across words, as can be seen 
from the last example in lla. At the same time, Jenny allowed VN.OV sequences freely, 
as seen in 11 b. Note that the consonants which were deleted from the VO.OV sequences 
were otherwise present in her speech, thus barring the possibility that an overall production 
constraint may have been at work. For example, /k/, produced as a retroflex t (velar 
fronting) occurred in words like _¢an, come, make, package, etc. /t/ in words like 

tea, cat, etc. /v/in envelope. The fact that she allowed VN.OV sequences 
furthermore shows that it was not a simple constraint against two consonant clusters. 
Rather, Jenny seemed to restrict her transyllabic clusters to the more optimal type, as 
defined by the Sonority Cycle, i.e. declining in sonority. 


4. Summary. 

The data presented in this paper, taken from several studies on phonological development 
suggests that child language acquisition is constrained by the same principles which have 
been found to hold on syllable structure cross-linguistically. In particular, we have 
focussed on the predominance of CV syllables in babbling and early meaningful speech, 
the imbalance of inventories of syllable-initial consonants as aang to syllable-final 
consonants, and several processes in child speech which have the ect of repairing certain 
syllable types which are defined as non-optimal by the Sonority Cycle. We have suggested 
that the hypothesis of the Sonority Cycle as a constraint on the mental representation of 
syllable structure can provide a unified explanation of these data. 
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AN ACOUSTIC ANALYSIS OF YOUNG CHILDREN’S 
PRODUCTIONS OF WORD STRESS 


Karen E. Pollock, Diane M. Brammer, and Carlin F. Hageman 
University of Northern Iowa 


Prosodic phenomena such as stress have tended to receive little 
attention in the literature on child phonology. Word stress, also 
sometimes called accent or stress-accent, is defined as giving 
prominence to one or more syllables of a multi-syllabic word. In adult 
English, stressed syllables can be identified by several acoustic 
characteristics: 

1. Intensity (stressed syllables may be louder). 

2. fesperee frequency (stressed syllables may be higher or Tower 

in pitch). 

3. Duration (stressed syllables may be louder). 

4, Frequency spectrum (vowel quality in stressed syllables may be 

more distinct). 
In adult English, the actual use of these characteristics varies 
greatly. Any combination of these factors may be used to mark the 
stressed syllable (or syllables) in a word. 

Not much is known about how or when children make use of these 
acoustic features to mark stress in words. Allen and Hawkins (1980) 
have suggested that children have particular difficulty in the reduction 
of unstressed syllables, that is, in decreasing the length or reducing 
the vowel quality of unstressed syllables. They observed that young 
children instead tend to either delete unstressed syllables altogether 
or to produce them with stress. 

In a study of five children between two and four years of age, Allen 
and Hawkins (1980) found that correct reduction of unstressed syllables 
ranged from only 35 to 65%. They also noted that the tendency to 
completely delete unstressed syllables occurred most often in word- 
initial position (e.g., [wel] for away). 

In another study, Allen and Hawkins (1980) looked at children’s use 
of fundamental frequency as well as duration. In words produced by 
three children approximately three years of age, stressed syllables were 
both longer in duration and higher in pitch. 

In both of these studies, real words were the focus of analysis. In 
later studies (Allen & Hawkins, 1980), nonsense words were constructed 
to allow for control over stress placement. Words were identical in 
segmental content, but differed in stress placement (e.g., [‘taki] and 
{ta’ki]}. Data from English speaking children from approximately three 
to seven years of age indicted that the children had little difficulty 
perceiving the difference between these pairs of words. However, some 
of the children had difficulty producing the words with second syllable 
stress (e.g., [ta’ki]). On the basis of these studies, Allen and 
Hawkins proposed that children are operating with a trochaic constraint, 
and are biased towards words with falling stress patterns. 
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Informal observations of young children’s early attempts at two- 
syllable words have suggested that such words are often produced with 
two equally stressed syllables or with stress on the wrong syllable 
(e.g., Leopold, 1947; Menn, 1976). Two descriptive studies of children 
approximately two years of age have also reported this use of level 
stress or misplaced stress. Klein (1984) looked at the multisyllabic 
productions of a single English-speaking two-year-old, and found a great 
deal of variability. Some words were produced with correct stress, some 
with misplaced stress, and others with level stress. Interestingly, 
newer words and imitations of novel words showed the greatest amount of 
variability. Words that were more established in the child’s vocabulary 
were produced with more consistent stress assignment. Hochberg (1988) 
locked at the production of stress in four Spanish-speaking children. 
Spanish has a more even distribution of words with final and penultimate 
stress than English. At the beginning of the study, when the children 
were 19 to 22 months of age, accuracy of stress placement was 
essentially at chance level. The children were no better or worse in 
accuracy on words with final versus penultimate stress. Over time, 
three of the children improved in their accuracy of stress production. 
On the basis of these data, Hochberg argued that children do not show a 
trochaic bias in their productions, but rather have a “neutral start" 
and are unbiased towards any particular stress type. 

Both Klein’s and Hochberg’s studies used perceptual transcriptions of 
stress. The analysis of data based on perceptual transcriptions is of 
some concern. Interjudge reliability of stress transcription with young 
children is typically poor. Brammer (1988) found average interjudge 
reliability for stress transcription increased with subject age, 
averaging 69% for two-year-olds, 74% for three-year-olds, and 83% for 
four-year-olds. One possible explanation for the poorer reliability 
with the younger children is that they may not be consisten. in their 
use of acoustic features for marking stress, and therefore may be 
providing the listeners with inconsistent or unreliable cues. 

The purpose of the present study was to investigate children’s use of 
three acoustic parameters in the production of two-syllable nonsense 
words. The specific questions addressed were: 

1) Which acoustic parameters (intensity, fundamental frequency, 
duration) are altered? 

2) Is there a difference in the use of these acoustic parameters 
across different age groups? 

3) Is there a difference in the use of these parameters when the 
stele syllable is in the first position versus the second position of 
the word? 


Method 
Subjects were six children each at two, three, and four years of age. 
All were screened for age-appropriate }anguage skills and normal hearing 
sensitivity. 
Eight novel CVCV words were created, controlling for segmental 
content and stress placement. All were of the shape CVCV. Two pairs of 
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syllables, [ba] - [fi] and [bo] - [da], were used. Words were created 
by altering the placement of these syllables within the word, and 
altering the placement of stress. The resulting experimental words are 
shown in Table 1. Referents for each word were unfamiliar objects to 
the children (e.g., a plastic grip for a bicycle handlebar). 

The experiment was conducted in a quiet room while the subject and 
experimenter played with both familiar toys and the unfamiliar objects. 
The excerimental words were elicited through an imitation procedure. 

The eight words were presented in a random order, with the restriction 
that words with identical segmental structure (e.g., {’boda} and 
{bo’da}]) were not presented consecutively. Each object was drawn from a 
bag by the examiner accompanied with a phrase, such as "Here’s a an 
With the younger children, spontaneous imitation of the experimental 
word often occurred. With the older subjects, imitations often needed 
to be prompted with the question, “What is it?" or "What’s its name?" 
Repeated presentations of the objects were made in an attempt to elicit 
at least three token productions of each word. However, it was not 
always possible to obtain three tokens for al} subjects, due to 
inattention to the task over time. As a result, a different number of 
productions were obtained for each subject. Table 2 shows the number of 
productions elicited for each age group for both first syllable stressed 
and second syllable stressed words. There were slightly fewer second 
syllable stressed words produced overall, due in part to the tendency of 
several two-year-old subjects to delete initial unstressed syllables. 

All experimental sessions were audiorecorded for later acoustic 
analysis. Productions that occurred more than 15 seconds after the 
adult’s mode] were not included in the analyses. Measures of peak 
fundamental frequency and peak intensity for each syllable were made 
using a Kay Elemetrics Mode] 6095 Visipitch. Measures of the absolute 
duration of each vowel were made from wide band spectrograms (600 Hz 
aralyzing filter) using a Kay Elemetrics Digital Sona Graph Model 7800. 


Table 1]. &xperimental words. Table 2. Number of words elicited. 
‘CVCV cv’CV Age ‘CVCV CVY’CY Total 
’bafi ba’ fi 2 67 54 121 
‘fiba fi'sa 3 70 65 136 
‘boda bo’da 4 74 81 155 
‘dabo da’bo 

a Total 211 201 412 
its 


The acoustic data were analyzed by three-factor mixed design analyses 
of variance (ANOVA) with repeated measures on two factors. A separate 


150 


aes 


143 


analysis was conducted for each of the three acoustic measures. Age 
(two, three, or four years) served as the between subjects variable. 
Stress level (stressed or unstressed) and syllable position (first or 
second) served as within subject variables. 

int Each subject’s mean peak intensity for stressed and 
unstressed syllables in first and second position is shown in Table 3. 
Results of the ANOVA for intensity revealed that the main effect of 
stress level was statistically significant, F(1,15) = 38.96, p < .001, 
indicating that the stressed syllable was more intense than the 
unstrecced syllable. The main effects of age and syllable position wer. 
not si. ‘ificant. However, a two-way interaction between age and stress 
level was Significant, F(2,15) = 7.86, p < .005. This interaction is 
displayed graphically in Figure 1. The graoh indicates that the 
stressed syllable was more intense than the unstressed syllable for the 
three- and four-year-olds, but not for the two-year-olds. No difference 
was apparent between the three- and four-year-olds. 

Fundamental Frequency. Table 4 shows each subject’s mean peak 
fundamental frequency for the stressed and unstressed syllables in first 
and second position. The results of the ANOVA found the main effect of 
stress level to be statistically significant, F(1,15) = 11.28, p < .005, 
confirming that the peak frequency of the stressed syllable was higher 
than that of the unstressed syllable. The ‘sain effects of age and 
syllable position were not significant. No interaction effects reached 
the significance level of .05. However, the two-way interaction between 
age and stress level approached significance, F(2,15) = 3.54, p = .055, 
which suggests that the peak frequency of the stressed syllable was not 
consistently higher for al] age groups. This interaction is shown in 
Figure 2, and suggests that the effect of stress is present for the 
three- and four-year-olds, but not for the two-year-olcs. No difference 
was observed between the three- and four-year-olds. 

The mean durat:on of the vowels in stressed and unstressed 
syllables in first and second position for each subject is shown in 
Table 5. The results of the duration ANOVA found the main effect of 
stress level to be statistically significant, F(1,15) = 127.02, 
p< .001, indicating that stressed syllahies were Le on the average 
than unstressed syllables. The main effect of syllable position was 
also significant, F(1,15) = 44.35, p < .001. That is, the duration of 
the second syllable was significantly longer than the duration of the 
first syllable. The main effect of age was not significant. 

As with the other measures, there was a significant interaction 
between age and stress level, F(2,15) = 8.53, p < .005. This 
interaction is displayed in Figure 3, which indicates that the effect of 
stress level is strongest for the three- and four-year-olds, although it 
is also present to a degree for the two-year-olds. The interaction 
between stress level and syllable position was aiso significant, F(1,15) 
= 52.06, p < .001. This interaction is shown in Figure 4, and indicates 
that the stressed syllable is longer if in the second position. 

Finally, the three-way interaction between age, stress level, and 
syliable position was significant, F(2,15) = 14.75, p < .001. Figure 5 
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Table 3. Mean intensity values (in dB) for each subject. 


Stressed Syllable 


Unstressed Syllable 


Age Subject First Second First Secon? 
2 ! 52,00 54.70 52.00 52.30 
2 49.30 48.50 53.30 44.60 

3 47.50 48.90 48.90 48.20 

4 48.40 46.50 49.70 50.30 

5 50.06 54.79 52.20 49.90 

6 50.10 54.00 52.20 50.30 

ra] 49.55 51.23 50.95 49.27 

SQ 1.56 3.68 1.78 2.64 

3 1 43.54 41.45 40.09 38.77 
2 50.00 49.90 47.60 85.20 

3 53.36 53.85 50.54 &7.00 

4 54.75 49.56 47.44 45.50 

5 58.85 57.00 57.25 57.69 

3) 54.45 62.75 54.25 48.55 

a 52.49 52.42 49.55 47.32 

3p §.22 7.27 6.03 6.12 

4 } 53.10 52.10 47.40 48.90 
2 54.50 54.93 53.20 51.50 

3 55.60 54.90 50.2} 51.50 

4 54.00 55.40 49.63 48.56 

5 46.79 48.85 §3.38 46.77 

6 46.92 52.54 40.08 48.54 

4 §].82 §3.12 47.32 49.86 

3D 3.93 2.50 4.82 2.86 
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Table 4. Mean fundamental frequency values (in Hz) for each subject. 


Stressed Syllable 


Unstressed Syllable 


Age Subject First Second First Second 
2 1 464.20 496.70 454.00 476.20 
2 325.80 317.20 367.30 297.50 
3 297.50 314.70 303.50 308.90 
4 289.50 250.00 275.80 300.60 
5 308.40 309.40 277.30 301.70 
6 353.40 316.00 326.70 388.70 
4 339.80 334.00 334.00 345.60 
A) 65.06 83.80 67.96 72.86 
3 l 282.90 259.80 270.10 248.30 
2 279.25 293.11 238.31 250.08 
3 279.40 286.73 270.36 247.50 
4 335.50 334.56 263.22 259.38 
5 270.83 277.67 259.08 282.17 
6 359.46 435.25 275.75 269.00 
] 301.22 314.52 279.46 259.41 
30 36.84 64.12 48.64 13.87 
4 l 284.80 293.30 263.80 255.90 
2 269.30 272.30 225.40 258.50 
3 277.90 269.90 240.20 280.00 
4 255,30 273.55 255.80 235.00 
5 301.50 300.00 285.28 280.46 
6 442.10 407.08 291.58 319.08 
8 305.15 302.69 260.34 274.49 
8p 68.84 52.61 25.52 28.85 
cr 
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Table 5. Mean duration values (in cs) for each subject. 


Stressed Syllable 


Unstressed Syllable 


Age Subject First Second First Second 
2 j 20.40 45.90 15.00 39.90 
2 23.60 37.80 22.50 33.90 

3 16.56 22.70 17.00 18.30 

4 14.20 26.50 17.50 15.70 

5 18.60 34.20 17.30 33.50 

6 19.60 31.70 20.80 17.50 

| 18.82 33.13 18.35 26.47 

§) 3.25 8.26 2.76 10.47 

3 i 22.77 33.64 14,73 30.04 
2 22.40 36.10 15.00 17.47 

3 17.81 28.17 10.7% 16.55 

4 19.25 34.81 14.25 17.00 

5 20.54 33.77 21.95 27.73 

6 16.32 22.00 8.50 16.73 

4 19.85 31.42 14.19 20.92 

a0 2.55 5.35 4.60 6.39 

4 1 17.60. 33.10 38.70 16.10 
2 22.40 36.10 15.00 19.47 

3 19.40 32.70 17.20 11,00 

4 18.19 35.30 21.96 B.40 

5 18.19 27.50 16.20 10.10 

6 15.20 40,00 24.50 36.30 

| 17.98 34.72 20.37 12.82 

8D 1.56 4.77 3.48 3,40 
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displays this interaction. For the two-year-olds, second syllables are 
still Yonger than first syllables, but stressed syllables are also 
longer than unstressed syllables, regardless of position. However, the 
difference between stressed and unstressed syllable duration appears 
greater in second position. For the four-year-olds, second syllables 
are longer than first syllables only if they are stressed. Second 
syllables are shorter when they are unstressed. 

It is interesting to look at the absolute duration values for the 
'CVCV targets. Across the age groups, the duration of the first vowel 
remains relatively constant, around 18 to 20 centiseconds. However, the 
absolute duration of the second vowel declines with increased age, from 
26 cs at two years, to 21 cs at three years, and finally to 13 cs at 
four years, indicating that the children are learning to reduce the 
length of unstressed syllables. 


Discussion 

The results of the present study suggest that al] three of the 
acoustic features measured (intensity, fundamental frequency, and 
duration) are rather poorly controlled at two years of age. At three 
years of age, both intensity and fundamental frequency are used 
consistently to mark the stressed syllable, regardless of its position 
in the word. However, it is not until four years of age that children 
seem to have control over duration as a feature for marking word stress. 

The present data also argue for a neutral start hypothesis for early 
stress production. The two year olds produced changes in intensity and 
fundamental frequency on individual tokens, but not consistently any 
direction. Instances of incorrect stress placement were seen not only 
in the use of first syllable stress for second syllable stressed 
targets, as would be predicted by the trochaic hypothesis, but also in 
the use of second syllable stress for first syllable stressed targets, 
in direct onposition to the trochaic hypothesis. 
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How Conservative are Children?: Evidence from Auxiliary Errors” 
Karin Stromswoid 
Massachusetts Institute of Technology and Harvard Medical School 


Introduction. The auxiliary system has received considerable attention from linguists and 
researchers studying language acquisition. One of the reasons for this attention is that the behavior of 
auxiliaries is very complicated; while certain syntactic and semantic traits are associated with many 
auxiliaries, some auxiliaries do not exhibit these traits. Linguists have tried to address the question of 


Researchers studying language acquisition have attempted to explain how children simultaneously learn 
the generalizations and restrictions on auxiliary order, combination, inflection, and placement (c.f, 
Baker, 1981, Pinker, 1984). In this paper, I will focus on how children acquire the auxiliary system. 


The child learning the English auxiliary system could adopt various strategies. At one 
extreme, she could acquire the auxiliary system productively, generalizing what she knows about the 
behavior of one auxiliary to another auxiliary. If she is too productive, however, she will make many 
mistakes. Consider, for example, what would happen if she generalized the behavior of do to can. if 


auxiliary system by rote, producing only those constructions which had been positively attested to in 
the input. If she adopted this strategy, she would make no errors. However, her acquisition of the 
auxiliary system would be extremely slow and protracted since she would never transfer what she knows 


In this paper, I make the following 3 point argument. In Section 1, I argue that the auxiliary 
system is so complicated that, if children were to generalize from one auxiliary to another, they would 
almost certainly m errors. In Section 2, I present the results of a search of a large corpus of 


1. Description of the Auxiliary System 


The following review of the auxiliary system is not meant to be a complete summary of the 
linguistic behavior of the auxiliary system. Rather, it is designed to give the reader a feel for the 
complexities of the auxiliary sysiem. 


Types of auxiliaries. 


There are 5 basic kinds of auxiliaries. The first subtype is the progressive be (e.g., the is in 


* This research was supported by a National Science Foundation Graduate Fellowship and a MacArthur 
Foundation M.D/Ph.D Fellowship awarded to the author. I would like to thank Steven Pinker and 
members of the CLRF audience for their insightful comments. I am also greatly indebted to Marie 
Coppola for her help with the transcript analyses. 
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she is eating). The second subtype is the passive be (e.g., the was in he was killed). Not all instances 
of be are auxiliaries, however. For example, the is in she is Aappy is a copula, not an auxiliary. 
Likewise the action proverb being in she was being naive is not an auxiliary. The third subtype of 
auxiliary is the aspectual have (¢.g., the have in they have been eating). Again not all instances of 
have are suxiliaries. The Aave in the sentence they have food is a main verb denoting possession. The 
fourth subtype of auxiliary is the auxiliary do (e.g., the do in the sentence / do not eat). The auxiliary 
do must be distinguished from the proverb do found in sentences like / do this. The fifth subtype of 
auxiliaries are the modal auxiliaries. The modals include can, could, will, would, shall, should, may, 
might, and must. The modals can and will must be distinguished from their homophonic main verb 
counterparts such as those found in the sentences he canned peas and he wisled her to come. 


In addition to these true auxiliaries, there are a group of verbs that exhibit some isolated 
auxiliary-like traits. I will refer to these verbs as pscudoauxiliaries. Examples of pseudosuxiliaries 
include the verbs need, better, gonna, wanna, and gotta. In many ways pseudoauxiliaries semantically 
and syntactically resemble modal auxiliaries. One can say, for example, you better go, you better not 
go, you better be going, you better have been going, etc.. If one replaces the word must for beer, the 
resulting sentences have the same structure and similar meaning as the sentences with better. Notice, 
however, that every pseudoauxiliary lacks certain behaviors which most modal auxiliaries exhibit. For 
example, need must be negated (¢.g., one can say you need not go but not *you need go). In addition, 
most pseudoauxiliaries cannot appear before a negation marker (¢.g., “you gonna not go, “you gotta not 
go,etc.), Finally, no pseudoauxiliary can appear before a subject (¢.g., “better you go?, “gonna you 
go?, etc.). 


Restriction on auxiliaries 


For ease of exposition, I will divide the restrictions on the behavior of auxiliaries into 4 types: 
inflectional restrictions, combination restrictions, order restrictions, and inversion restrictions. 


Infectional restrictions. With the exception of modal auxiliaries, all auxiliaries agree 
with their subjects (subject-verb agreement, or SVA). Thus, for example, one can say she is going but 
not *she are going. A second inflectional restriction is that the only auxiliary that can take the 
progressive -ing inflection is the passive be (for example, the being in he is being traded to the Mets). 
One can't say *musting, *ising, *didding, or *hadding. Furthermore, while having, or doing are 
acceptable, they are acceptable only as main verbs. 


Combination restrictions. In general, most of the possible combinations of auxiliaries 
are not acceptable. For example, one can say / could have eaten a horse, but one cannot say */ could 
eaten a horse. Even within subtypes of anxiliaries, some combinations are acceptable whereas others are 
not. For example, the combination of modal + aspectual Aave is acceptable for all modals except can. 
Sentences like */ can have eaten, *] can have been eating, *f can have been kicked, and *] can have been 
hungry are unacceptable even though these same sentences would be grammatical with any other modal. 
Another example of restrictions within a subtype of auxiliary is the cliticization of the negation marker 
not. The contracted form of nor (n's) can cliticize onto all forms of be except am (Le., one can say 
isn’t, aren't, wasn't, weren't, but not *ama‘t). Among the forms of the auxiliary do, only don’t is 
irregular (i.e., one says didn's and doesn’t but not *do-n't), All of the modals except will, shall, and 
may have regular contracted forms (i.e., one cannot say *wilin’t, *shalln’t, and *mayn't). 


Order restrictions. The subtypes of auxiliaries are strictly ordered as follows: 
<MODAL> <HAVE> < PROGRESSIVE BE> <PASSIVE BE> 


One can best appreciate this ordering in sentences which contain all of the subtypes of auxiliaries. 
Consider, for example, the sentence he must have been being tortured the entire time he was in jail. 
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Notice that if one switches the onder of any of the auxiliaries, the sentence is ungrammatical (¢.g., *he 
must have being been tor _..,¢tc.). A second order restriction is that if there is a tense or SVA 
inflection, it must appear on the first auxiliary. A third restriction is that, if the content of the entire 
sentence is to be negated, the negation marker must follow the first auxiliary.! 


Inversion Restrictions. Lastly, there are restrictions which determine whether a subject 
appears before an auxiliary or whether the order is inverted. In questions, the left-most auziliary and the 
sabject invert in matrix questions but not in embedded questions or in how come questions.” Thus, for 
example, when can he go? is a grammatical matrix question, whereas *when he cam go? is not In 
embedded questions and how come questions the suxilisry and subject do not invert. Thus, / wonder if 
he can go and how come he can go? are grammatical, whereas the inverted */ wonder if cam he go and 
*how come can he go? are ungrammatical. 


Things are not that simple, however. The first complication is that some non-auxiliaries can 
invert. In all modem dialects of English, the copula inverts. Thus, one can say is she happy? even 


happy to be the grammatical way to question the sentence she is happy. Additionally, in United 
Kingdom dialects of English, the main verb have can invert ( _ have you a match?).3 A second 
complication is that most pseudoauziliaries cannot invert (e.g., *better you go?, *gonna you go?, etc.). 


The above suggests that the behavior of auailiaries is extremely complicated. The auxiliary 
system seems to be filled with linguistic landmines waiting for the child who makes the wrong 
generalizations. Before I proceed to examine the errors children make, I will give a few examples of 


she did this, she would make many errors. One type of exror which is uniquely predicted by this model, 
is that she would either invert main verbs (if she generalizes the behavior of auxiliary verbs) or not 
invert auxiliary verbs (if she generalizes the behavior of main verbs). Another possibility is the child 
might notice that all auxiliaries exhibit subject-auxiliary inversion and from this conclude that all 
auxiliaries belong to a single subtype. If she did this, she would not make the inversion errors above, 
However, we would still expect her to either fail to add the SVA marker to nonmodals (if she generalized 
the behavior of modals) or add it to modals (if she generalized the behavior of nonmodals). 


1 One can say, for example, / could have not been late, but semantically what seems to be negated isn't 
the entire utterance, but rather the “been late” portion. 

2 The auxiliary can also occur before the subject in exclamatives (¢.g., can she cook!) and negative 
polarity sentences (e.g., Never have / seen such a mess). Because these forms are very rare in carly 
speech, they will not be discussed. 

3 It probably isn’: the case that U.K.-English possessive inverts because it is a full-fledged auxiliary. 
Unlike a true auxiliary, it can appear as the sole verb in a sentence (¢.g., they have apples). 
Furthermore, sentences in which possessive have precede an uncontracted not sound stilted at best (e.g., 
*2/ have not any apples). 
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2. Auxiliary Errors Made by Children 


In Section 1, I outlined some of the types of auxiliary errors we would expect children to make 
if they generalized the behavior of one auxiliary to another auxiliary. I conducted analyses of the 
auxiliary errors in children's speech to determine which, if any, auxiliary errors children make. 

Corpus. I searched for auxiliary errors in the ChiLDES transcripts (MacWhinney and Snow, 
1985) of the 14 children shown in Figure 1. I used the UNIX utility “fgrep” to cull all of the children’s 
lines which contained an auxiliary or a negation marker.4 This yielded a corpus which contained 
approximately 55,700 uses of auxiliaries? and 15,000 uses of negation markers. 


Figure 1: Child Transcripts Analyzed 


Bloom (1973): 
Brown (1973): 


Clark (1978): 
Higginson (1985): i 1;10-2;11 


1;3-1;9 
0;11-0;11 
MacWhinney & Snow 1;5-6;0 
(1985): 2;6-3; 


Sachs (1983): 
| Suppes (1973): 


Procedure. I went through the corpus of utterances which contained auxiliaries or negation 
markers looking for mistakes involving auxiliaries or negation. In addition, for each question that 
coniained a subject and an auxiliary, J determined whether the auxiliary was inverted. Utterances that had 
contracted, stuttered, or unclear auxiliaries and utterances which were obvious routines or imitations 
were not included in the ezror analyses or inversion analyses. An undergraduate research assistant did 


4 The computer searched for lines which contained the following letter strings: am, are, be, better, can, 
could, do, did, gonna, gotta, had, has, have, is, may, might, must, need, never, no, not, n't, shall, 
should, was, were, will, won't, and would. - 

5 Note that occurrences of copula be, and main verb have and do are included in these tallies. 
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the same analyses for a subset of the auxiliary and negation corpus. For all analyses, the concordance 
ratings were greater than 90%. 


Results 


A Inflectional Errors. 


lilici¢ Inflectional Endings. There was only one example of a child appending an illicit 
inflectional marker to an auxiliary. Children never applied the SVA marker to medals (i.¢., there were 
no examples like *he cans go). Likewise, children never applied the progressive -ing inflection to a 
modal (i.e., there were no examples like *musting). Furthemore, while the children frequently said 
doing, being, and having, there was one example in which a child added the progressive inflection to an 
auxiliary do, Aave, or be (Eve 2;0: while me [/] while me being sitting on it). 


Overregutarization. Many auxiliaries have past tense and SVA for:as which are irregular. 
Notice, for example, that one says she was eating rather than *she be-d eating and she is eating rather 
than *she be-s eating. It is well-known that children often go through a stage during which they say 
“eated” for “ate” and “sayes” for /sez/ (c.f., Pinker & Prince, 1988, for a review). Unless children are 
completely conservative in their acquisition of auxiliaries, we would expect them to occasionally 
produce overregularized past tense and SVA forms. I found no such examples. Children made no errors 
with modals (Le., they never said “canned” for “could”). I found 28 overregularized forms of do, have, 
and be, but in all 28 cases the overregularized verb was a main verb and not an auxillary. In addition, I 
found 134 cases in which children used be where they should have used is, ave, or am. In 128 of 134 
examples, be was 8 copula and not an auxiliary. 


In summary, the children made essentially no inflectional errors with auxiliaries. They made 
no auxiliary errors despite the fact that they inflected and overregularized main verbs which were 
homophonic to auxiliaries. 


B. Combination Errors. 


Illicit Combinations of Auxiliaries. As was discussed in Section 1, most 
combinations of auxiliaries are not allowed. Despite this fact, there were only 30 cases of what appear 
to be illicit combinations of auxiliaries in questions. All of the examples of illicit auxiliary 
combinations in questions invoived having an auxiliary both before and after the subject. Thirteen of 
the 30 examples had the copuls be (e.g., Adam (3:4): is it was a snake?) rather han a “true” auxiliary. 
Among declarative sentences, there were only a half dozen cases which could be examples of illicit 
combinations of auxiliaries, All but one of these examples involved a copula (i.¢., all but one were 
similar to Adam (2:7): you don’t be quiet). Given that the combination modal+aspectual Aave is 
acceptable for all modals except can, one might expect that errors like */ can have gone would be 
particularly common. This was not the case. I found 40 modal+aspectual Aave utterances but none of 
them used the modal can. This is despite the fact that the children used can more frequently than any 
other modal, 


Illicit Combinations of Auxiliaries and Inflections. In addition to most 
combinations of auxiliaries being ungrammatical, most combinations of auxiliaries and inflected main 
verbs are unacceptable. Therefore, I searched for ungrammatical combinations of auxiliaries and 
inflections. Lonly counted examples which were ungrammatical because of the presence of an 
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extraneous inflection or auxiliary (e.g., Adam (2;7) cowboy did fighting me).® I found that fewer than 
0.1% of children’s questions were doubie-tensed. Double-tensing was even rarer among declaratives. 
Furthermore, there were 10 cases like *does she going? whese do-support was incorrectly provided in 
questions. Likewise, judging from context, there were fewer than 20 cases where children incorrectly 
provided a do in an unemphatic declarative sentences. In other words, there were very few exaniples like 
1 do taste dem (Sarah; 3;0). 


Auxiliary + =’t. The negation marker can cliticize onto most but nos all auxiliaries; 
*amn’t, *do-n't, *mayn' t, *shalln't, *wilin’t are not acceptable. I found only one case of amn’: and one 
case of wilin’t among the almost 15,000 uses of negation markers. 


In sum, there were very few illicit combinations of auxiliaries despite over 55,000 possible 
oppostunities. The frequency of children’s auxiliary combination esrors does not appear to be 
dramatically greater than what one would expect to find in adult speech. When adults make suxiliary 
combination errors, these errors are obviously production errors and not the result of a faulty grammar. 
Since children's errors are no more frequent than adult errors, one could argue that they are production 
errors just like the adult errors. 


C. Word Order Errors Other than Inversion Errors. 


Auxiliary Order. In utterances that had more than one auziliary, children never scrambled 
the relative order of the auxiliaries. 


Misplaced Inflection. In all but 10 of the children’s utterances which had auxiliaries and a 
tense or SVA inflection. the inflectional marker appeared on the first auxiliary. 


Negation. Children placed a negation marker after a main verb in only 5 utterances, In 
other words, they essentially never said things like *he taking not all of de walls down (Adam, 3,5). 


D. Inversion Errors 


Which Verbs Can Invert. There were no examples of the children inverting 
pseu Joauxiliaries. In other wonds, they never asked questions like *Berter] go? In addition, I found 
only 10 questions in which children inverted a main verb (e.g., where goes one?). Conversely, children 
never placed aren't after / (e.g., */ aren't going) even though there were 22 questions in which children 
placed aren't before I (e.g., Aren't I gaing?). 


What Settings Allow Inversion. The children sometimes fa..ed to invert in settings 
which require inversion. In 8% of the matrix questions that had auxiliaries, children placed the subject 
before the auxiliary (e.g., Adam (3;3): What I will read?). Furthermore, all of the children 
who asked at least 26 anestions with auxiliaries made this mistake at least once. The children also 
sometimes inverted where they shouldn't have. They inverted subject and auxiliary in 6% of the 
embedded questions that had auxiliaries (e.g., Adam (4;3): I wonder what are dese for’?). In addition, 
they inverted subject and auxiliary in 14% of how come questions that had auxiliaries (¢.g., Adam 
(4:10): How come is dat?). 


In summary, inversion errors were by far the most common and pervasive type of auxiliary 
error, However, not all possible types of inversion errors occurred. For example, the children 


§ 1 did not count examples which were ungrammatical because an inflection or an auxiliary was 
missing (e¢.g..*she playing or *she is play) because such examples are likely to be the result of a 
production constraint. 
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essentiatly never inverted verbs which are uninvertible. They did make two types of inversion errors: 
they occasionally neglected to invert in settings that required inversion, and they permitted inversion in 
settings that do not allow inversion. 


3. Discussion 


In Section 1, | argued that the linguistic behavior of euziliaries is so complex that, if children 
generalized from one auxiliary to another, they would almost certainly make errors. In Sectioa 2, I 
presented the results of analyses of over 55,000 spontaneous uses of auxiliaries by children. The results 
of these analyses suggest that inversion errors are the only type of auxiliary error that children make 
with any appreciable frequency. 1 would like to argue that the scarcity of most types of auxiliary exrors 
suggests that children acquire most aspects of the auxiliary system conservatively. it is possible that 
this pattern of errors reflects different acquisitional strategies. These different acqnisitional strategies 
might in tum reflect a linguistic diffe. ence between most of the properties associated with the auxil.ary 
system and auxiliary inversion. 

This pattern of errors is consistent with the following linguistic mode!. The lexicon could 
contain a list of which inflected forms, combinations, and orders of suxiliaries are acceptable.” The 
lexicon could also contain a list of which auxiliaries can invert. Because these lists are finite, children 
could lear them essentially by rote. It is not possible, however, to list all of the sentence strings 
which permit the auxiliary to appear before the subject (¢.g.. is the man eating, is the big man eating, is 
the big ugly man eating, etc.). Inversion seems to be the result of a syntactic process which can occur 
in some settings and not others. Whether an invertible auxiliary does invert depends on restrictions on 
when this syntactic process can occur, Children must lea what these restrictions are. 


The rarity of most types of auxiliary errors is consistent with children acquiring the auxiliary 
system in a conservative fashion. The lack of errors does not, however, prove that children are 
conservative leamers. It could be that children do generalize, but that they are either extremely lucky or 
the generalizations they make are fairly conservative. Given that I searched over 50,000 uses of 
auxiliaries for errcrs, it is unlikely that the children just happened not to make most of the possible 
types of errors. A more likely possibility is that children are extremely smart about the generalizations 
they make. Notice that all the members °f the be subtyye, the do subtype, and the have subtype behave 
pretty much like one another. Children . . id fairly safely generalize within each of these subtypes. 


7 Schachter (1983) and Pinker (1984) have suggested that one feature determines which inflectional 
forms, combinations and orders of auxiliaries are acceptable. If they are right, then the lexicon would 
merely have to list the values for this one feature. This would simplify the lexicon and, hence, 
acquisition. Tt would also lead one to expect that inflection, combination and order errors would clustez. 
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Negation presents another seriaus problem for the nonconservative leamer. First, some 
auxiliaries can only appear with negation markers. For example, one can say you nezd not go, but one 
cannot ssy *you need go. Second, the negative marker cannot cliticize onto am or may, and shail and 
will have irregular negated forms. Third, aren't can appear before the pronoun / but not after it (e.g., 
aren't! going? but not */ aren't going). Finally, certain negated forms of auxiliaries have different 
connotations depending on whether they appear before or after the subject. 


In summary, the analysis of children’s auxiliary exsrors provides little or no evidence of a 
productive leaming strategy. However, it may be that there are no errors because children make exactly 
the right generalizations. In addition to looking at auxiliary ezrors, one may be able to determine 
whether children are conservative or productive by examining the acquisitional time course for the 
various auxiliaries. For example, one could determine hether all of the members of an auxilicry 
subtype exhibit certain linguistic behaviors at the same time. One could also determine whether 
auxiliaries that are acquired late immediately exhibit all of the traits associated with that subtype of 
auxiliary or whether these traits have to be leamed anew for each new modal. These analyses wold 
complement the error analyses presented here. Together they should help reveal how conservative 
children are in their acquisition of the auxiliary system. 
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CHILDREN'S PRODUCTION OF SUBJECTS: 
COMPETENCE, PERFORMANCE, AND THE NULL SUBJECT PARAMETER 


Virginia Valian 
Department of Psychology, Hunter College 
695 Park Avenue, New York, NY 10021 


When and how do children determine that their language does or does 
not require overt surface subjects? English requires an overt subject. 
For example (taken from Hyams, 1986), in English, (1) is grammatical 
while (2) is not allowed. In contrast, in Italian, both (3) and (4) 

(1} I am going to the movies 
(2) * Am going to the movies 
are allowed. There can be an empty or "null" subject in Italian. 
(3) Io vado al cinema 
I am going to the movie: 
(4) Vado al cinema 
Am going to the movies 

English and French are among the languages that require overt 
subjects; Italian and Spanish are among the languages that allow null 
subjects. The null subject parameter may also contro] other features of 
language features in addition to whether an overt subject is required 
(for a list of candidates, see Riemsdijk and Williams, 1986: 298-303). 
For example, languages that allow null subjects typically do not have 
“expletive”, or referentially empty, pronouns. 

An example, in English, of a referentially empty pronoun is the 
"it" in (5). In Italian, such a non-referential form of "it" does not 

(5) It seems that Jane loves Mary 
(6) Sembra que Jane ama Mary 
Seears that Jane loves Mary 
exist. The only equivalent of (5) in Italian is {6). Expletive "it" 
also occurs in “weather” expressions: “It's reining,” "It's snowing,” 
"It's cold,” and so on. 

The acquisition of the null subject parameter has received much 
recent attention, beginning with Hyams's (1986) original and pi wocative 
analysis. Hyams offered an explanation for an apparently ubiquitous 
phenomenon in American children's early speech: subjects ere often 
absent. The sheer ubiquity of this phenomenon, which has never received 
a satisfactory explanation, made it an excellent candidate for study. 

Hyams's (1986) explanation was that American children began 
acquisition with the null subject parameter set at the wreng value, the 
value which optionally allowed null subjects, and entailed other 
grammatical consequences. In addition to explaining why American 
children frequently omitted subjects, Hyams's (1986) account 
simultaneously explained other features of children's speech that 
apparently cooccurred with subject optionality. For example, her 
account predicted that children who omitted subjects would also fail to 
use expletive subjects when thes were required, and that expletives 
would enter the children's repertoire at the same time that subjects 
were used as required. In addition, her account predicted that children 
would fail to use Modals until they used subjects consistently. 
Previously reported data appeared to confirm both predictions. 
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What was so attractive about Hyams's (1986) analysis was that it 
accounted for a diverse body cf phenomena under one theory. The theory 
provided both a linguistic analysis of the null subject parameter, and 
an account of acquisition based on the linguistic analysis. There have 
since been a variety of competing explanations for American children's 
inconsistent use of subjects. 

Surprisingly, however, there are no numeric?) data bearing on 
American children's use of subjects. Although Hyams (1987) states that 
"it 4s well-known that thematic (referential) lexical subjects are 
optional in early child language and that expletive subjects are 
entirely lacking,” that statement {s incorrect. Existing accounts use 
either summary data reported in other contexts, or data from very small 
samples of children. This study presents the first quantitative data 
on American use of subjects, and the first comparison of American and 
Italian children's production of subjects. 

The data show that children's use of subjects has been erroneously 
described. Further, none of the current competence accounts of 
acquisition of the null subject parameter is supported. Finally, the 
data raise difficult methodological questions about the interpretation 
of inconsistent usage by children. 

Cross-sectional American data and longitudinal Italian data are 
compared. Twenty-one American children were audiotaped in conversations 
with their mothers, typically on two occasions no more than two weeks 
apart, for about 1 and 1/2 hours total. The children ranged in age from 
1:10 to 2;8. MLU was computed from each taping session, and the average 
MLU was used. This average MLU ranged from 1.53 to 4. The children 
were divided into 4 unequal groups on the basis of their MLU 
distribution. Group I consists of 5 children between MLU 1.53 and 1.99. 
Group II consists of 5 children between MLU 2.24 and 2.76. Group IIT 
consists of 8 children between MLU 3.07 and 3.72. Group IV consists of 
3 children between 4.12 and 4.38. In this sample of children MLU and 
age were highly correlated, r = 74, 

The children's speech was coded, and several categories were 
excluded from further analysis: utterances with unintelligible 
portions, single word assents or dissents, imitations, and routine 
utterances. In addition, for the analyses here, imperatives without 
subjects were excluded. Utterances were classified as imperatives on 
the basis of context. 

Table i shows what percentage of the children's non-imsperative, 
non-imitative, utterances with verbs contained subjects. The five 
children in Group I, between MLU 1.53 and 1.99, used subjxcts close to 
70% of the time when they produced utterances containing verbs. There 
is a shift in percentage of usage between Group I and Group II. Group 
II, whose MLUs range between 2.24 and 2.76, used subjects clese to 90% 
of the time. Performance was consistently high thereafter, increasing 
slightly in the next two groups. As the standard deviation indicates, 
Group I was the most variable group. 

The high consistency of usage in Group II suggests that children 
with an MLU of 2.8 or luss understand that English requires subjects, 
and can consistently eapress that understanding in their speech, if we 
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take 85-90% usage to be consistent usage.” "Earliness” is in the eye of 
the beholder, but such a high level of usage at this MLU indicates that, 
even in “early” child language, American children understand that their 
language is not a4 null subject language. 

The next question is how to interpret the 70% average figure for 
Group I, especially considering the variability in this group, where two 
children had subject use as low as 55%. There are two possibilities. 
One is that the child has net yet established the correct value. The 
child could think either that subjects are optional, or could be unsure 
about whether subjects are optional or obliatory. Either way, on 
possibility one, the child's competence is deficient. The second 
possibility is that the child understands that subjects are required, 
but is unable to express that understanding in her speech. On 
possibility two, the child's performance is deficient. 

How can one decide between the two possibilities? Two strategies 
can be used. One is to explore both competence and performance factors 
within English. The other is to Compare English-speaking children with 
children of a null subject language. The American data suggest that a 
performance explanation can handle all the facts, a conclusion supported 
by the Italian data, which show that American and Italian children 
differ markedly in their productions. 

To begin with the American children, we can examine other 
properties that the children's speech should have if they understand 
that subjects are obligatory, using the higher-MLU children as a 
benchmark. If Group ! children believed their language was a null 
subject language, one might expect them to use particularly few 
pronominal subjects, since it is pronominal subjects that can be 
omitted. The subjects that are expressed ghould be lexical rather than 
pronominal. Instead, a majority of Group 1 children's subjects are 
pronouns ~ 77%. Pronouns included personal] pronouns, demonstratives, 
and interrogatives. Pronoun use continues to be high, comprising 87% of 
Group II's subjects, 85% of Group III's, and 84% of Group IV's. 

Another potential competence measure is production of the expletive 
"it", But if our benchmark is the higher-MLU children, expletive usage 
cannot serve as a measure. For all 4 groups of children, even those who 
are consistently producing subjects, there is little expletive usage, 
and little usage of expletive contexts. Children of this age and at 
this level of development are seldom interested in talking about the 
weather. Other contexts that would support expletive usage require 
embedded structures that are act within the children's repertoire. 

To the extent that there is use of expletives, it occurs across the 
board. For example, the lowest MLU child produced an expletive: "when 
{t's noe-y,” where “noe-y" equals “snowy” or “snowing”. In fact, that 
was this child's only production of the word "it". Since even the 
contexts in which expletives would be required are rare, the lack of 
expletives is not meaningful. 

A third competence measure is presence of Modais, which, on some 
theories of the null subject parameter, could be related to production 
of subjects. Only the lowest MLU child failed to produce Modals. All 
others produced them, though usage was infrequent in Group I. Table 2 
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shows Moda] usage for each group. Usage is expressed by using the 
number of strings with Verbs as & baseline. Thus, the figures show what 
proportion of strings with Verbs included a Modal. Modal usage 
increases as a function of age and MLU, but there is no hint of a step 
function. That is, Modal usage does not suddenly begin when subject use 
becomes consistent, nor dramatically increase when subject use becomes 
consistent. Nor is Modal usage is correlated with subject usage, once 
MLU and age are partialled out (r = -~.03). Children's use of subjects 
is roughly constant regardless of how many Modais they produce. 

What about performance measures? The data I will report here 
complement P. Bloom's (1989). If children's usage of subjects is tied 
to their overall development and ability to produce longer and longer 
utterances, then there are certain correlations one might expect. For 
example, MLU, age, and subject use should correlate, and they do. In 
this sample, the simple correlation between MLU and subject use is 79 
(p < .001), and the simple correlation between age and subject use is 
.74 (p < .001). When a partial correlation between MLU and subject use 
is computed, the correlation is .48 (p = .03). Thus, the data suggest 
that children use 3ubjects more as their MLU increases. 

Another performance measure concerns correlations between Verb 
usage and subject usage. As Table 2 shows, children's Verb usage 
increases markedly from Group I to Group IV. Verb usage, unlike Modal 
usage, is highly correlated with subject usage, even when MLU and age 
are partialled out, r= .78, p< .001. As children produce more 
utterances with Verbs, they correspondingly produce subjects for those 
Verbs. Notice that Verb production is theoretically independent of 
subject use as measured here. The children could maintain @ constant 
proportion of subject usage as their Verb production goes up. 

The fact that subject usage and Verb usage are linked across the 
entire range of children suggests that children know that Verbs require 
subjects. As they are able to handle the complexity involved in 
including Verbs, they are correspondingly able to handle the complexity 
involved in including subjects. If the children believed that subject 
use was optional, their usage should remain roughly constant even though 
Verb usage increases. 

A final factor can be introduced to account for Group I's 
inconsistent production of subjects. Although subjects are 
grammatically required in English, in some contexts speakers onit 
subjects. An example from the NY Times is, "Seems like she always has 
something twin-related perking.” Other examples are “Want your lunch 
now?", “Having a good time?”. The American child hears examples of 
subject omission, and has to determine just what contexts acceptably 
allow omission (for more discussion, see Valian, 1989). 

To summarize, the child operates under performance limitations. 
She can also infer that it is acceptable to omit subjects in certain 
contexts (because she hears strings without subjects), without yet 
having zero'ed in on the contexts that support omission (because they 
are very complicated). As a result, subjects are absent too frequently. 

The second strategy for asseseing the significance of American 
children's very early inconsistent usage is to compare the children's 
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performance with that of young speakers of'a null subject language. 
Mazuka, Lust, Wakayama, and Snyder (1986), for example, looked at early 
Japanese usage. Italian data, consisting of longitudinal transcripts 
collected by G. Tirondola, and lent to me by F. Antinucci, provide an 
appropriate contrast. There are 5 Italian children, each 1;6-1;7 at the 
start of taping. They were recorded monthly for a year, except for one 
summer break, which occurred after the Sth taping. Each recording 
session was short, about 18-30 minutes. Because so few utterances are 
available, the 11 sessions are divided into Time I and Time II. 

Time I covers the first 5 months, before the summer break. The 
children were about 1;6 to 1;10 during that time. Time II covers the 
last 6 months, after the Summer break. The children were 2;0 to 2:5 
during that time. The Italian children at Time II were thus on the 
average somewhat older than the American children in Group I, who 
averaged 2;0. However, it would be desirable to have a linguistic way 
of comparing the children. Since MLU cannot be calculated in the same 
way with Italian children as with American children, it was not 
computed. Xe did calculate the number of utterances with Verbs, in the 
same way that we had for the American children. At Time I, 27% of the 
Italian children's utterances contained a Verb. That was the percentage 
for the American children in Group 1. At Time II, 39% of the Italian 
children's utterances contained a Verb. 

At both Time I and Time II, the Italians differ from even Group I 
Americans. Table 3 shows the children's use of subjects. The total 
proportion of non-imperative non-imitative strings containing subjects 
stays constant at about .30, less than half the average usage of Group I 
Americans. This shifts what has to be explained about the American data 
from, why do Group I Americans use subjects so little, to, why do Group 
I. Americans use subjects so much? The comparison suggests that Group | 
Americans already understand that subjects are syntactically required. 

To continue with the Italian children, at Time I the Italian 
children produce twice as Rény post-verbal as pre-verbal subjects, 
echoing Bates (1976). This may reflect the children’s initial 
comprehension that overt post-verbal subjects Carry focus information. 
By Time II the children have doubled their production of pre-verbal 
subjects, but total production of subjects retains the same from Time I 
to Time II. Again the Italians differ from the Americans, in that they 
do not increase their overall rate of subject production, within the 
time period sampled. 

The Italian children also produce less than half ss many pronominal 
subjects as the American children. Recall that for Group I of the 
American children, about 75% of the subjects were pronouns. At Time I 
22% of the Italian children's subjects were pronouns, and at Time II 35% 
were pronouns. At both Times I and II, pronouns were more common in 
post-verbal than pre-verbal position. This, again, may reflect the fact 
that post-verbal subjects tend to be the focus. 

With respect to expletive contexts, the Italian children did 
produce a few verbs that demanded the absence of a subject. At Time I 
there was one instance; at Time II there were a total of 14 instances, 
produced by 4 of the 5 children, involving 3 verbs: bisognare, piovere, 
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While this is a low rate of production, it is more frequent 
than is the case for the American children. This may, however, reflect 
the meanings of 2 of the 3 verbs: bisognare has to do with there being 
a need for something, and bastare with there being enough of something. 
With respect to Modals, the Italian children produced many fewer 
than the American children. At Time I there was one occurrence; at Time 
II 3 of the § children produced 7 examples. If Modal use is relativized 
to Verb use, as it was with the American children, that is a total of 
6/392 non-imperative utterances with Verbs, or a proportion of .015, 


about half the American production. 
In sum, the lowest-MLW group of American children does not look 


anything like either Time I or Time If Italian children. The American 
children produce, from the beginning, more subjects, more pronominal 
subjects, and more Modals. By itself, such data are not definitive, 
since the children may be matching the adult frequencies of usage. 
American children could think English is a null subject language, where 
the option of omitting the subject is infrequently used by speakers. 
what makes that interpretation less likely, however, is the fact that 
all children operate under production constraints. If American children 
believed their language was a null subject language, and needed to 
select constituents to omit, subjects would be an excellent choice, even 
if th> adult frequency were high. : 

Thus, it is hard to explain the differences between American and 
Italian children if we assume that both language groups think subjects 
are optional, and if we assume that both language groups Operate under 
production constraints. If we instead conclude that even the lowest MLU 
group of American children understands that subjects are syntactically 
required in English, we can explain their inconsistent usage as due to 
two factors: production constraints plus the existence of acceptable 
strings in English that lack subjects. 
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Table i 
American Children 


Proportion of Non-Isaperative Non-Imitative Utterances 
Containing a Verb Which Also Contain a Subject 


Mean Age Mean Proportion Subject (s.d.) 


Mean MLU 
Proportion Subject Range 


MLU Range Age Range 


Croup I (N = 5, Mean of 86 utterances with a Verb per child) 
1.77 2;0 .69 (.12) 
1.53-1.99 1;10-2;2 . .55 - .82 


Group II (N = 5, Mean of 207 utterances with a Verb per child) 


2.49 


2;5 .89 (.04) 
2.24-2.76. 2;3-2 


;8 .84 - .94 

Group III (N = 8, Mean of 286 utterances with a Verb per child) 
3.39 2;5 .93 (.04) 

3.07-3.72 2:3-2;6 .87 - .99 

Group IV. (N = 3, Mean cf 280 utterances with a Verb per child) 


4.22 2:7 .95 (.03) 
4.12-4.38 2;6-2;8 .92 - .97 
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Table 2 


American Children's Modal and Verb Usage 


Subject Usage Modal Usage® verb Usage” 
Group I .69 .03 .ar 
Group II .89 .06 .52 
Group III .33 , 09 .70 
Group IV 95 .14 .79 


Note. In all cases, strings with unintelligible segments, single word 
assents or dissents, imitations, or imperatives, were excluded. 


® For Modal Usage, the numerator is number of Modals and the denominator 
4s number of strings with Verbs. 


b For Verb Usage, the numerator is number of strings with Verbs and the 
denominator is number of strings. 


Table 3 
Italian Children 
Proportion of Non-Imperative Non-Imitative Utterances 


Containing a Verb Which Also Contain a Subject 


Pre-Verbal Post-Verbal Total 
(range) (range) (range) 
Time I (N = 5, Mean of 39 utterances with a Verb per child) 


.09 «ed .29 
(.05-.19) (.15-.24) {.22-.43) 


Time II (N = 5, Mean of 78 utterances with a Verb per child) 


245 14 .29 
(.08-.18) (.06-.23) (.18-.41) 
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